欢迎访问《声学技术》编辑部！

文章摘要

王群,曾庆宁,谢先明,郑展恒.低信噪比环境下的语音识别方法研究[J].声学技术,2017,36(1):50~56

低信噪比环境下的语音识别方法研究

Research on speech recognition in low SNR environment

投稿时间：2016-07-20 修订日期：2016-09-29

DOI：10.16300/j.cnki.1000-3630.2017.01.010

中文关键词: 语音增强低信噪比改进维纳滤波对数最小均方误差算法语音识别

英文关键词: speech enhancement low SNR modified Wiener filter LogMMSE algorithm speech recognition

基金项目:国家自然科学基金（61461011）、教育部重点实验室2016年主任基金（CRKL160107）资助项目

作者	单位	E-mail
王群	桂林电子科技大学信息与通信学院, 广西桂林 541004
曾庆宁	桂林电子科技大学信息与通信学院, 广西桂林 541004
谢先明	桂林电子科技大学信息与通信学院, 广西桂林 541004
郑展恒	桂林电子科技大学信息与通信学院, 广西桂林 541004	glzzh@guet.edu.cn

摘要点击次数: 1167

全文下载次数: 1534

中文摘要:

单通道语音信号在信噪比较大的环境下经过增强后再识别，能表现出较高的识别率。但是在低信噪比环境下，增强后语音信号的识别率急剧下降。针对此种情况，提出了一种用在识别系统前端的语音增强算法，该增强算法将采集到的带噪语音信号先使用对数最小均方误差（Logarithmic Minimum Mean Square Error，LogMMSE）提高其信噪比，然后再利用改进的维纳滤波去除噪声残留并提升语音可懂度，最后用梅尔频率倒谱系数（Mel-Frequency Cepstral Coefficients，MFCC）和隐马尔科夫模型（Hidden Markov Model，HMM）对增强后的语音信号做特征提取并识别。实验分析结果表明，该方法能有效地抑制背景噪声并减少噪声残留，显著提升低信噪比环境下语音识别的准确性。

英文摘要:

The accuracy rate of single channel enhanced speech recognition in high SNR environment is acceptable, but not so in low SNR environment. In this case, speech enhancement based on logarithmic minimum mean square error (LogMMSE) algorithm and modified Wiener filter algorithm is presented. Firstly the gathered speech signals' SNR is improved by the LogMMSE algorithm. Then using the improved Wiener filter algorithm removes residual noise and improves the signal quality. Finally the enhanced speech is used for recognition by MFCC and HMM algorithms. Experimental results show that the proposed method can effectively remove the background noise and reduce the residual noise, significantly increase the accuracy of the automatic speech recognition in noisy environment.

查看全文查看/发表评论下载PDF阅读器

关闭