Real-time continuous detection and recognition of dynamic hand gestures in untrimmed sequences based on end-to-end architecture with 3D DenseNet and LSTM
Lu, Zhi4; Qin, Shiyin2,3; Lv, Pin4; Sun, Liguo4; Tang, Bo1
刊名MULTIMEDIA TOOLS AND APPLICATIONS
2023-07-14
页码38
关键词Continuous detection Gesture recognition Long short-term memory 3D densely connected convolutional networks Connectionist temporal classification Canonical correlation analysis
ISSN号1380-7501
DOI10.1007/s11042-023-16130-1
通讯作者Lu, Zhi(zhi.lu@ia.ac.cn)
英文摘要With the continuous development of the deep learning theory, novel gesture recognition approaches have been constantly emerging, and the performance has also been continuously improved. However, most research methods focus on the recognition of isolated gestures, and the detection and recognition of continuous gestures are rarely studied. To this end, aiming at the real-time detection and classification of dynamic gestures in untrimmed sequences, a well-designed end-to-end architecture based on the variants of 3D DenseNet and unidirectional LSTM was hereby proposed as an effective tool to extract the discriminative spatio-temporal features of untrimmed hand gesture sequences. Then, connectionist temporal classification was combined to train the network on a publicly available dataset, and some effective capacities could be transferred to enhance the learning ability of the proposed network by training large gesture samples. In this way, the class-conditional probability of an incoming sequence belonging to a given gesture class was predicted and then compared with a predefined threshold to automatically determine the start and end of gestures. In addition, to enhance the classification accuracy of segmented gestures, a bidirectional LSTM network was utilized to model the temporal information, with both the past frames and the future ones taken into account. Finally, a continuous gesture dataset collected indoors for specific application was introduced to validate the proposed method. On this challenge dataset, the 3D DenseNet-LSTM model achieves real-time early detection and classification tasks on unsegmented gesture sequences, and the 3D DenseNet-BiLSTM not only achieves an accuracy of 92.06% on segmented gestures, but also a classification accuracy of 89.8% and 99.7% on nvGesture and SKIG public datasets, respectively. The experimental results demonstrate the performance advantages of the detection and classification as well as the real-time response speed.
资助项目National Natural Science Foundation of China[61731001] ; Natural Science Foundation of Zhejiang Province[LY21E050017]
WOS关键词FUSION ; ROBUST
WOS研究方向Computer Science ; Engineering
语种英语
出版者SPRINGER
WOS记录号WOS:001030536100003
资助机构National Natural Science Foundation of China ; Natural Science Foundation of Zhejiang Province
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/53726]  
专题复杂系统认知与决策实验室
通讯作者Lu, Zhi
作者单位1.China Jiliang Univ, Coll Metrol & Measurement Engn, Hangzhou 310018, Zhejiang, Peoples R China
2.Beihang Univ, Sch Automat Sci & Elect Engn, 37 Xueyuan Rd, Beijing 100191, Peoples R China
3.Dongguan Univ Technol, Sch Elect Engn & Intelligentizat, Dongguan 523808, Peoples R China
4.Chinese Acad Sci, Inst Automat, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Lu, Zhi,Qin, Shiyin,Lv, Pin,et al. Real-time continuous detection and recognition of dynamic hand gestures in untrimmed sequences based on end-to-end architecture with 3D DenseNet and LSTM[J]. MULTIMEDIA TOOLS AND APPLICATIONS,2023:38.
APA Lu, Zhi,Qin, Shiyin,Lv, Pin,Sun, Liguo,&Tang, Bo.(2023).Real-time continuous detection and recognition of dynamic hand gestures in untrimmed sequences based on end-to-end architecture with 3D DenseNet and LSTM.MULTIMEDIA TOOLS AND APPLICATIONS,38.
MLA Lu, Zhi,et al."Real-time continuous detection and recognition of dynamic hand gestures in untrimmed sequences based on end-to-end architecture with 3D DenseNet and LSTM".MULTIMEDIA TOOLS AND APPLICATIONS (2023):38.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace