Investigating Parameter Sharing in Multilingual Speech Translation
Wang, Qian; Wang, Chen; Zhang, Jiajun
2022
会议日期18-22 September 2022
会议地点Incheon, Korea
英文摘要

End-to-end multilingual speech translation (ST) directly models the mapping from the speech in source languages to the text of multiple target languages. While multilingual neural machine translation has been proved effective in modeling the general knowledge with shared parameters and handling inter-task interference with language-specific parameters, it still lacks exploration of when and where parameter sharing matters in multilingual ST. This work offers such a study by proposing a comprehensive analysis on the influence of various heuristically designed sharing strategies. We further investigate the inter-task interference through gradient similarity between different tasks, and improve the parameter sharing strategy in multilingual ST under the guidance of inter-task gradient similarity. Experimental results on the one-to-many MuST-C dataset have shown that the gradient-guided sharing method can significantly improve the translation quality with a comparable or even lower cost in terms of parameter scale.

语种英语
内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/50601]  
专题模式识别国家重点实验室_自然语言处理
通讯作者Zhang, Jiajun
作者单位1.School of Artificial Intelligence, University of Chinese Academy of Sciences
2.National Laboratory of Pattern Recognition, Institute of Automation, CAS
推荐引用方式
GB/T 7714
Wang, Qian,Wang, Chen,Zhang, Jiajun. Investigating Parameter Sharing in Multilingual Speech Translation[C]. 见:. Incheon, Korea. 18-22 September 2022.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace