文章摘要
郭乐乐,曹辉,李涛.有效特征参数分类正常与病理语音[J].声学技术,2019,38(5):554~559
有效特征参数分类正常与病理语音
Classification of normal and pathological speech by effective feature parameters
投稿时间:2018-05-07  修订日期:2018-07-18
DOI:10.16300/j.cnki.1000-3630.2019.05.012
中文关键词: 残差信号  基音幅值  频谱平坦度  倒谱峰值突出  支持向量机
英文关键词: residue signal  pitch amplitude (PA)  spectral flatness of the residue signal (SFR)  cepstral peak prominence (CPP)  support vector machine
基金项目:国家自然科学基金资助项目(11074159、11374199)
作者单位E-mail
郭乐乐 陕西师范大学物理学与信息技术学院, 陕西西安 710100  
曹辉 陕西师范大学物理学与信息技术学院, 陕西西安 710100 caohui@snnu.edu.cn 
李涛 陕西师范大学物理学与信息技术学院, 陕西西安 710100  
摘要点击次数: 705
全文下载次数: 817
中文摘要:
      采用残差信号的特征参数——基音幅值(Pitch Amplitude,PA)和频谱平坦度(Spectral Flatness of the Residue Signal,SFR)与语音信号倒谱域特征参数——倒谱峰值突出(Cepstral Peak Prominence,CPP)来区分正常与病理语音,在萨尔布吕肯语音数据库中选择自然音调的正常与病理语音/a/进行仿真实验。统计结果表明,与正常语音相比,病理语音的PA较小,SFR更接近零,CPP也较小。结合其他传统特征参数分析对比,证明SFR、PA和CPP更能有效分类正常与病理语音。通过不同分类算法比较,得出支持向量机的分类准确率相对更高。
英文摘要:
      The feature parameters PA (pitch amplitude) and SFR (spectral flatness of the residue signal) and the vowel cepstrum domain feature parameter CPP (cepstral peak prominence) are used to distinguish between normal and pathological speeches. In the Saarbruecken Voice Database, 216 normal and 216 pathological natural tones/a/are selected for experiments. The statistical results show that compared with normal speech, the PA value of pathological speech is smaller, the SFR value is close to zero, and the CPP value is also smaller. Combined with other features analysis and comparison, it is proved that SFR, PA, and CPP are excellent and stable feature parameters for normal and pathological speech classification. The classification accuracy obtained by support vector machine is relatively higher by the comparison of different classification algorithms.
查看全文   查看/发表评论  下载PDF阅读器
关闭