欢迎访问《声学技术》编辑部！

文章摘要

沈彩凤,俞一彪.一种新的汉语连续语音声调评测算法[J].声学技术,2013,32(4):305~311

一种新的汉语连续语音声调评测算法

A novel tone evaluation algorithm for Chinese continuous speech

投稿时间：2012-05-10 修订日期：2012-08-27

DOI：10.3969/j.issn1000-3630.2013.04.010

中文关键词: 声调评测连续语音 Spline插值 Fujisaki模型高斯混合模型

英文关键词: tone evaluation continuous speech Spline interpolation Fujisaki-model Gaussian Mixture Model

基金项目:国家自然科学基金资助项目(61271360);苏州市应用基础研究计划资助项目(SYG201230)

作者	单位	E-mail
沈彩凤	苏州大学电子信息学院语音技术研究室, 江苏苏州 215006	mantianxing45610@126.com
俞一彪	苏州大学电子信息学院语音技术研究室, 江苏苏州 215006

摘要点击次数: 1781

全文下载次数: 2557

中文摘要:

提出一种新的连续语音的声调评测算法,该算法可应用于计算机辅助语言学习系统和普通话水平测试中的声调评测。考虑到连续语音声调受上下文之间的相互影响,采用三音节单元建立高斯混合模型(Gaussian Mixture Model, GMM),三音节中辅音部分用Spline插值法拟合声调曲线来反映音节间基音频率的转移信息,并利用Fujisaki模型去除语句的语调和说话人个性特征,只对基频曲线中的声调特征建模。实验结果显示,相比于传统方法,采用三音节Spline插值和Fujisaki改进特征的方法使得机器与人工打分的相似度在测试集中分别提高了8.75%和14.09%。

英文摘要:

A new algorithm of objective tone evaluation for Chinese mandarin continuous speech is proposed, which can be used for the tone pronunciation training in Computer Assisted Language Learning (CALL) system and the test of Chinese mandarin speech named as Putonghua Shuiping Ceshi (PSC). A syllable's tone is influenced by context in continuous speech. Therefore, it is reasonable to use tri-syllables as basic units to train GMM (Gaussian Mixture Model) of tones. To get the transition information from the previous voiced region to the current one or from the current to the next voiced region, the pitch value of unvoiced region is interpolated with Spline function. Based on the Fujisaki model, only the lexical tone from the F0 contour is extracted to train GMM. The experimental results show that the correlations between subject and object evaluations based on Spline interpolation and Fujisaki model are improved by 8.75% and 14.09% respectively, comparing to the traditional features.

查看全文查看/发表评论下载PDF阅读器

关闭