文章摘要
沈彩凤,俞一彪.一种新的汉语连续语音声调评测算法[J].声学技术,2013,32(4):305~311
一种新的汉语连续语音声调评测算法
A novel tone evaluation algorithm for Chinese continuous speech
投稿时间:2012-05-10  修订日期:2012-08-27
DOI:10.3969/j.issn1000-3630.2013.04.010
中文关键词: 声调评测  连续语音  Spline插值  Fujisaki模型  高斯混合模型
英文关键词: tone evaluation  continuous speech  Spline interpolation  Fujisaki-model  Gaussian Mixture Model
基金项目:国家自然科学基金资助项目(61271360);苏州市应用基础研究计划资助项目(SYG201230)
作者单位E-mail
沈彩凤 苏州大学电子信息学院语音技术研究室, 江苏苏州 215006 mantianxing45610@126.com 
俞一彪 苏州大学电子信息学院语音技术研究室, 江苏苏州 215006  
摘要点击次数: 1752
全文下载次数: 2526
中文摘要:
      提出一种新的连续语音的声调评测算法,该算法可应用于计算机辅助语言学习系统和普通话水平测试中的声调评测。考虑到连续语音声调受上下文之间的相互影响,采用三音节单元建立高斯混合模型(Gaussian Mixture Model, GMM),三音节中辅音部分用Spline插值法拟合声调曲线来反映音节间基音频率的转移信息,并利用Fujisaki模型去除语句的语调和说话人个性特征,只对基频曲线中的声调特征建模。实验结果显示,相比于传统方法,采用三音节Spline插值和Fujisaki改进特征的方法使得机器与人工打分的相似度在测试集中分别提高了8.75%和14.09%。
英文摘要:
      A new algorithm of objective tone evaluation for Chinese mandarin continuous speech is proposed, which can be used for the tone pronunciation training in Computer Assisted Language Learning (CALL) system and the test of Chinese mandarin speech named as Putonghua Shuiping Ceshi (PSC). A syllable's tone is influenced by context in continuous speech. Therefore, it is reasonable to use tri-syllables as basic units to train GMM (Gaussian Mixture Model) of tones. To get the transition information from the previous voiced region to the current one or from the current to the next voiced region, the pitch value of unvoiced region is interpolated with Spline function. Based on the Fujisaki model, only the lexical tone from the F0 contour is extracted to train GMM. The experimental results show that the correlations between subject and object evaluations based on Spline interpolation and Fujisaki model are improved by 8.75% and 14.09% respectively, comparing to the traditional features.
查看全文   查看/发表评论  下载PDF阅读器
关闭