帮助 关于我们

返回检索结果

基于动静态特征双输入神经网络的咳嗽声诊断COVID-19算法
A Dynamic-Static Dual Input Deep Neural Network Algorithm for Diagnosing COVID-19 by Cough

查看参考文献20篇

文摘 新型冠状病毒肺炎(COVID-19)已经在世界范围内造成了严重影响,在防控疫情方面学者们进行了大量研究.利用咳嗽声判断病变部位来诊断新冠肺炎具有非接触、成本低、易获取等优点,但是此类研究在国内较为匮乏.梅尔倒谱系数(Mel Frequency Cepstral Coefficients,MFCC)特征仅能够表示声音的静态特征,而一阶差分MFCC特征还能反应声音的动态特征.为了更好地防治新冠肺炎,本文提出了基于动静态特征双输入神经网络的咳嗽声诊断新冠肺炎算法,通过咳嗽声诊断新冠肺炎.在Coswara数据集基础上,对咳嗽声的音频进行裁剪,提取MFCC和一阶差分MFCC特征训练了一个动静态特征双输入神经网络模型.本文模型采用统计池化层,可以输入不同长度的MFCC特征.实验结果表明,与现有模型相比较,本文算法明显提升了识别准确率、召回率、特异性和F1值.
其他语种文摘 The COVID-19 (corona virus disease 2019) has caused serious impacts worldwide.Many scholars have done a lot of research on the prevention and control of the epidemic.The diagnosis of COVID-19 by cough is non-contact,low-cost,and easy-access,however,such research is still relatively scarce in China.Mel frequency cepstral coefficients(MFCC) feature can only represent the static sound feature,while the first-order differential MFCC feature can also reflect the dynamic feature of sound.In order to better prevent and treat COVID-19,the paper proposes a dynamic-static dual input deep neural network algorithm for diagnosing COVID-19 by cough.Based on Coswara dataset,cough audio is clipped,MFCC and first-order differential MFCC features are extracted,and a dynamic and static feature dual-input neural network model is trained.The model adopts a statistic pooling layer so that different length of MFCC features can be input.The experiment results show the proposed algorithm can significantly improve the recognition accuracy,recall rate,specificity,and F1-score compared with the existing models.
来源 电子学报 ,2023,51(1):202-212 【核心库】
DOI 10.12263/DZXB.20211630
关键词 深度学习 ; 咳嗽声 ; 新冠肺炎 ; 梅尔倒谱系数 ; 音频技术 ; 卷积神经网络
地址

北方工业大学信息学院, 北京, 100144

语种 中文
文献类型 研究性论文
ISSN 0372-2112
学科 电子技术、通信技术;自动化技术、计算机技术
基金 国家重点研发计划
文献收藏号 CSCD:7419794

参考文献 共 20 共1页

1.  Laguarta J. COVID-19 artificial intelligence diagnosis using only cough recordings. IEEE Open Journal of Engineering in Medicine and Biology,2020,1:275-281 CSCD被引 9    
2.  张小恒. 面向帕金森病语音诊断的非监督两步式卷积稀疏迁移学习算法. 电子学报,2022,50(1):177-184 CSCD被引 3    
3.  世界卫生组织. 2019冠状病毒病(COVID-19)专题问答,2020 CSCD被引 2    
4.  Brown C. Exploring automatic diagnosis of COVID-19 from crowdsourced respiratory sound data. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. Virtual Conference,2020:3474-3484 CSCD被引 1    
5.  Han J. Exploring automatic COVID-19 diagnosis via voice and symptoms from crowdsourced data. ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing,2021:8328-8332 CSCD被引 1    
6.  Andreu-Perez J. A generic deep learning based cough analysis system from clinically validated samples for point-of-need COVID-19 test and severity levels. IEEE Transactions on Services Computing,2022,15(3):1220-1232 CSCD被引 1    
7.  Imran A. AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app. Informatics in Medicine Unlocked,2020,20:100378 CSCD被引 3    
8.  Bagad P. Cough against COVID: Evidence of COVID-19 signature in cough sounds,2020 CSCD被引 2    
9.  赵建. 基于DNN-HMM声学模型的连续猪咳嗽声识别. 农业工程技术,2020,40(30):93 CSCD被引 3    
10.  黎煊. 基于深度信念网络的猪咳嗽声识别. 农业机械学报,2018,49(3):179-186 CSCD被引 17    
11.  李伟红. 低信噪比下公共场所异常声音声学特征提取. 声学学报,2019,44(5):934-944 CSCD被引 2    
12.  Alex Geertsen. GitHub-covid19-cough/dataset: Dataset of recordings of induced cough,2020 CSCD被引 1    
13.  Muguli A. DiCOVA challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics,2021 CSCD被引 1    
14.  Sharma N. Coswara-A database of breathing, cough, and voice sounds for COVID-19 diagnosis,2020 CSCD被引 1    
15.  Davis S. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing,1980,28(4):357-366 CSCD被引 79    
16.  顾玲玲. 息肉与麻痹喉声源分类中非线性动力学发声系统模型研究. 声学学报,2015,40(6):878-885 CSCD被引 3    
17.  Okabe K. Attentive statistics pooling for deep speaker embedding,2019 CSCD被引 3    
18.  Kamble M R. PANACEA cough sound-based diagnosis of COVID-19 for the DiCOVA 2021 Challenge,2021 CSCD被引 1    
19.  Deshpande G. The DiCOVA 2021 challenge-An encoder-decoder approach for COVID-19 recognition from coughing audio. Interspeech 2021,2021:931-935 CSCD被引 1    
20.  Chang J. DiCOVA-Net: Diagnosing covid-19 using acoustics based on deep residual network for the DiCOVA challenge 2021,2021 CSCD被引 1    
引证文献 1

1 李志营 基于深度学习语音分析的双相障碍患者情绪时相检测 中华精神科杂志,2024,57(4):207-212
CSCD被引 1

显示所有1篇文献

论文科学数据集
PlumX Metrics
相关文献

 作者相关
 关键词相关
 参考文献相关

版权所有 ©2008 中国科学院文献情报中心 制作维护:中国科学院文献情报中心
地址:北京中关村北四环西路33号 邮政编码:100190 联系电话:(010)82627496 E-mail:cscd@mail.las.ac.cn 京ICP备05002861号-4 | 京公网安备11010802043238号