×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
自动化研究所 [70]
北京大学 [38]
清华大学 [24]
兰州理工大学 [16]
计算技术研究所 [13]
声学研究所 [8]
更多...
内容类型
期刊论文 [133]
会议论文 [46]
其他 [30]
学位论文 [24]
发表日期
2023 [3]
2022 [6]
2021 [14]
2020 [7]
2019 [4]
2018 [6]
更多...
学科主题
Engineerin... [2]
computer s... [1]
计算机科学技术::人... [1]
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共233条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene
期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:
You, Sisi
;
Zuo, Yukun
;
Yao, Hantao
;
Xu, Changsheng
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/12/21
Cross-modal audio-visual fusion
incremental learning
person recognition
elastic weight consolidation
feature replay
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art
期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:
Mengting Liu
收藏
  |  
浏览/下载:2/0
  |  
提交时间:2024/01/25
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation
Visually Guided Sound Source Separation With Audio-Visual Predictive Coding
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 15
作者:
Song, Zengjie
;
Zhang, Zhaoxiang
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2023/11/17
Feature fusion
multimodal learning
predictive coding (PC)
self-supervised learning
sound source separation
Music Theory-Inspired Acoustic Representation for Speech Emotion Recognition
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2534-2547
作者:
Li, Xingfeng
;
Shi, Xiaohan
;
Hu, Desheng
;
Li, Yongwei
;
Zhang, Qingchen
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2023/11/17
Affective computing
speech emotion recognition
acoustic representation
music theory and speech analysis
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:
Yi, Jiangyan
;
Tao, Jianhua
;
Fu, Ruibo
;
Wang, Tao
;
Zhang, Chu Yuan
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2023/11/17
Adversarial training
multi-task learning
prosodic boundaries
speech synthesis
multi-modal embeddings
Train from scratch: Single-stage joint training of speech separation and recognition
期刊论文
COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15
作者:
Shi, Jing
;
Chang, Xuankai
;
Watanabe, Shinji
;
Xu, Bo
收藏
  |  
浏览/下载:33/0
  |  
提交时间:2022/07/25
Cocktail party problem
Speech separation
Multi-speaker speech recognition
End-to-end
Joint-training
A retrieval method for encrypted speech based on improved power normalized cepstrum coefficients and perceptual hashing
期刊论文
Multimedia Tools and Applications, 2022, 卷号: 81, 期号: 11, 页码: 15127-15151
作者:
Zhang, Qiu-yu
;
Bai, Jian
;
Xu, Fu-jiu
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2022/06/20
Authentication
Chaotic systems
Discrete wavelet transforms
Efficiency
Extraction
Hamming distance
Hash functions
Information retrieval
Principal component analysis
Speech
Cepstrum
Chaotic mapping
Encrypted speech
Encrypted speech retrieval
Features extraction
Henon chaotic mapping
Perceptual hashing
Power
Power normalized cepstrum coefficient
Speech feature extraction
Speech features
Speech retrieval
Probability Enhanced Entropy (PEE) Novel Feature for Improved Bird Sound Classification
期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 1, 页码: 52-62
作者:
Ramashini Murugaiya
收藏
  |  
浏览/下载:186/0
  |  
提交时间:2022/01/25
Bird sounds
classification
Gammatone frequency cepstral coefficient (GTCC)
probability enhanced entropy (PEE)
support vector machine (SVM)
Research on Video Captioning Based on Multifeature Fusion
期刊论文
Computational Intelligence and Neuroscience, 2022, 卷号: 2022
作者:
Zhao, Hong
;
Guo, Lan
;
Chen, ZhiWen
;
Zheng, HouZe
收藏
  |  
浏览/下载:24/0
  |  
提交时间:2022/06/20
Embedded layers
Frame features
Incomplete information
Large-scale datasets
Mode features
Multi-feature fusion
Performance
Pre-training
Single mode
Video frame
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching
期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 338-351
作者:
Zheng, Aihua
;
Hu, Menglan
;
Jiang, Bo
;
Huang, Yan
;
Yan, Yan
收藏
  |  
浏览/下载:36/0
  |  
提交时间:2022/03/17
Visualization
Task analysis
Measurement
Speech recognition
Videos
Location awareness
Image recognition
Adversarial learning
audio-visual matching
cross-modal learning
metric learning
©版权所有 ©2017 CSpace - Powered by
CSpace