文章摘要
张建伟,陶亮,周健,王华彬.噪声谱估计算法对语音可懂度的影响[J].声学技术,2015,34(5):424~430
噪声谱估计算法对语音可懂度的影响
Effects of noise spectrum estimation algorithms on speech intelligibility
投稿时间:2014-12-15  修订日期:2015-03-29
DOI:10.16300/j.cnki.1000-3630.2015.05.009
中文关键词: 噪声谱估计  谱减法  时频块  最小统计  短时客观可懂度  语音可懂度
英文关键词: noise spectrum estimation  spectrum subtraction  time-frequency blocks  Minima Statistics(MS)  Short-Time Objective Intelligibility(STOI)  speech intelligibility
基金项目:国家自然科学基金(61301219、61003131)、安徽省自然科学基金(1408085MF113)资助项目。
作者单位E-mail
张建伟 安徽大学计算智能与信号处理教育部重点实验室, 安徽合肥 230031 zhangjianwei.i.123@163.com 
陶亮 安徽大学计算智能与信号处理教育部重点实验室, 安徽合肥 230031  
周健 安徽大学计算智能与信号处理教育部重点实验室, 安徽合肥 230031  
王华彬 安徽大学计算智能与信号处理教育部重点实验室, 安徽合肥 230031  
摘要点击次数: 1659
全文下载次数: 2900
中文摘要:
      噪声谱估计是单通道语音增强算法的关键步骤,当前大部分语音增强算法旨在提高语音质量,提高语音可懂度的算法却很少。在传统的单通道语音增强算法中,语音质量的提高往往是以牺牲语音的可懂度为代价的。对目前主流的几种噪声谱估计算法对语音可懂度影响进行分析。在不同噪声背景、不同信噪比情况下进行噪声谱估计,并采用谱减法对含噪语音信号作去噪处理,对比分析不同噪声、不同信噪比下增强前后语音的短时客观可懂度 (Short-Time Objective Intelligibility, STOI)值,最后根据信噪比,对比分析了不同噪声环境下,语音增强前后语音能量高于噪声能量的时频块所占比例。实验表明,相比其他噪声估计算法,最小统计(Minima Statistics, MS)算法由于保留了更多的以语音能量为主的时频块,使得去噪后的语音有较高的可懂度。
英文摘要:
      Noise spectrum estimation is a key step in single channel speech enhancement algorithms. Most of current speech enhancement algorithms are designed to improve speech quality, however, algorithms for increasing speech intelligibility are few. The traditional speech enhancement algorithms improve speech quality, while sacrificing speech intelligibility. In this paper, classical noise spectrum estimation algorithms are evaluated for their effects on speech intelligibility. Noise spectrum is estimated in different noise environments with SNRs between -9 dB and 3 dB. The spectral subtraction is thereafter used for speech denoising. The STOI(Short-Time Objective Intelligibility) value of the enhanced speech is computed. At last, according to the signal-to-noise ratio, the proportions of speech dominated time-frequency blocks under different noise environments are analyzed. Experimental results show that, compared with other noise estimation algorithms, the minimum statistics (MS) obtains high speech intelligibility because it retains more speech dominated time-frequency blocks after speech denoising.
查看全文   查看/发表评论  下载PDF阅读器
关闭