×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
自动化研究所 [7]
计算技术研究所 [1]
内容类型
期刊论文 [5]
会议论文 [3]
发表日期
2021 [8]
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共8条,第1-8条
帮助
限定条件
发表日期:2021
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition
会议论文
Tokyo, Japan, 14-17 December 2021
作者:
Zhengkun Tian
;
Jiangyan Yi
;
Ye Bai
;
Jianhua Tao
;
Shuai Zhang
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2022/06/14
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization
会议论文
Brno, Czechia, 30 August – 3 September
作者:
Zhengkun Tian
;
Jiangyan Yi
;
Ye Bai
;
Jianhua Tao
;
Shuai Zhang
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2022/06/14
Decoupling_Pronunciation_and_Language_for_End-to-End_Code-Switching_Automatic_Speech_Recognition
会议论文
Toronto, ON, Canada, 2021-6-11
作者:
Shuai Zhang
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2022/06/17
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:
Fan, Cunhang
;
Yi, Jiangyan
;
Tao, Jianhua
;
Tian, Zhengkun
;
Liu, Bin
收藏
  |  
浏览/下载:27/0
  |  
提交时间:2021/03/08
Speech enhancement
Speech recognition
Training
Noise measurement
Logic gates
Acoustic distortion
Task analysis
Gated recurrent fusion
robust end-to-end speech recognition
speech distortion
speech enhancement
speech transformer
CTNet: Conversational Transformer Network for Emotion Recognition
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:
Lian, Zheng
;
Liu, Bin
;
Tao, Jianhua
收藏
  |  
浏览/下载:33/0
  |  
提交时间:2021/05/06
Emotion recognition
Context modeling
Feature extraction
Fuses
Speech processing
Data models
Bidirectional control
Context-sensitive modeling
conversational transformer network (CTNet)
conversational emotion recognition
multimodal fusion
speaker-sensitive modeling
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:
Ye Bai
;
Jiangyan Yi
;
Jianhua Tao
;
Zhengkun Tian
;
Zhengqi Wen
收藏
  |  
浏览/下载:23/0
  |  
提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别
Bridging Text and Video: A Universal Multimodal Transformer for Audio-Visual Scene-Aware Dialog
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 2476-2483
作者:
Li, Zekang
;
Li, Zongjia
;
Zhang, Jinchao
;
Feng, Yang
;
Zhou, Jie
收藏
  |  
浏览/下载:29/0
  |  
提交时间:2021/12/01
Task analysis
Feature extraction
Visualization
Speech processing
History
Social networking (online)
Pattern recognition
Dialogue System
Multimodal
Natural Language Processing
Video Understanding
Medical Term and Status Generation From Chinese Clinical Dialogue With Multi-Granularity Transformer
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 3362-3374
作者:
Li, Mei
;
Xiang, Lu
;
Kang, Xiaomian
;
Zhao, Yang
;
Zhou, Yu
收藏
  |  
浏览/下载:26/0
  |  
提交时间:2021/12/28
Medical diagnostic imaging
Transformers
Task analysis
Medical services
Computational modeling
Semantics
Data mining
Medical dialogue
multi-granularity
attention mechanism
natural language understanding
sequence to sequence learning
©版权所有 ©2017 CSpace - Powered by
CSpace