Pitch-Scaled Analysis based Residual Reconstruction for Speech Analysis and Synthesis
Wen, Zhengqi; Kawahara, Hideki; Tao, Jianhua; Zhengqi Wen
2012
会议日期2012
会议地点美国
关键词Speech Parametric Representation Pitch-scaled Analysis Voicing Cut-off Frequency Principal Component Analysis
页码374-377
英文摘要The typical problem in LPC-like vocoder is buzzing sound which is mainly due to the simple pulse train or noise excitation model. One way to improve it is to reconstruct the residual obtained from inverse filtering. So a new parametric representation of speech based on pitch-scaled analysis is proposed in this paper. Pitch-scaled analysis is used to extract the periodic spectrum of residual with half pitch period length. Then these periodic spectrums are de-correlated by principal component analysis (PCA) to reduce their dimension. Aperiodic measure is defined as the harmonic-to-noise ratio in the frequency domain where voicing cut-off frequency (VCO) is used to control the smoothness of aperiodicity. Periodic spectrum and aperiodic measure together with F0 are indicated as excitation parameters in the proposed LPC vocoder. Experimental results show that this proposed vocoder can get a mean opinion score (MOS) of 4.1 for a female voice before dimensionality reduction and keep the high-quality property after parameter compression.
会议录Annual Conference of the International Speech Communication Association (INTERSPEECH)
内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/41278]  
专题模式识别国家重点实验室_智能交互
通讯作者Zhengqi Wen
推荐引用方式
GB/T 7714
Wen, Zhengqi,Kawahara, Hideki,Tao, Jianhua,et al. Pitch-Scaled Analysis based Residual Reconstruction for Speech Analysis and Synthesis[C]. 见:. 美国. 2012.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace