Semi-supervised Ladder Networks for Speech Emotion Recognition | |
Tao, Jianhua1,2,3; Huang, Jian1,2; Li, Ya1; Lian, Zheng1,2; Niu, Mingyue1,2 | |
刊名 | International Journal of Automation and Computing |
2019-03 | |
卷号 | 16期号:4页码:437-448 |
关键词 | Speech emotion recognition the ladder network semi-supervised learning autoencoder regularization |
英文摘要 | As a major component of speech signal processing, speech emotion recognition has become increasingly essential to understanding human communication. Benefitting from deep learning, many researchers have proposed various unsupervised models to extract effective emotional features and supervised models to train emotion recognition systems. In this paper, we utilize semi-supervised ladder networks for speech emotion recognition. The model is trained by minimizing the supervised loss and auxiliary unsupervised cost function. The addition of the unsupervised auxiliary task provides powerful discriminative representations of the input features, and is also regarded as the regularization of the emotional supervised task. We also compare the ladder network with other classical autoencoder structures. The experiments were conducted on the interactive emotional dyadic motion capture (IEMOCAP) database, and the results reveal that the proposed methods achieve superior performance with a small number of labelled data and achieves better performance than other methods. |
内容类型 | 期刊论文 |
源URL | [http://ir.ia.ac.cn/handle/173211/39297] |
专题 | 模式识别国家重点实验室_智能交互 |
通讯作者 | Tao, Jianhua |
作者单位 | 1.National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China 2.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China 3.CAS Center for Excellence in Brain Science and Intelligence Technology, Beijing, China |
推荐引用方式 GB/T 7714 | Tao, Jianhua,Huang, Jian,Li, Ya,et al. Semi-supervised Ladder Networks for Speech Emotion Recognition[J]. International Journal of Automation and Computing,2019,16(4):437-448. |
APA | Tao, Jianhua,Huang, Jian,Li, Ya,Lian, Zheng,&Niu, Mingyue.(2019).Semi-supervised Ladder Networks for Speech Emotion Recognition.International Journal of Automation and Computing,16(4),437-448. |
MLA | Tao, Jianhua,et al."Semi-supervised Ladder Networks for Speech Emotion Recognition".International Journal of Automation and Computing 16.4(2019):437-448. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论