Active Temporal Action Detection in Untrimmed Videos via Deep Reinforcement Learning
Li, Nan-Nan; Guo, Hui-Wen; Zhao, Yang; Li, Thomas; Li, Ge
刊名IEEE ACCESS
2018
文献子类期刊论文
英文摘要Existing action detection algorithms usually generate action proposals through an extensive search over the video at multiple temporal scales, which brings about huge computational overhead and deviates from the human perception procedure. We argue that the process of detecting actions in the video should be naturally one of observation and refinement: observe the current temporal window and refine the span of attended window to cover true action regions. In this paper, we propose an active action detection model that learns to search actions through continuously adjusting the bounds of temporal attended window in a self-adaptive way. The whole process can be deemed as an exploring procedure, where an agent is first placed at the beginning of the video and then traverses the whole video by adopting a sequence of transformations on the current attended window to discover actions according to a learned policy. We utilize reinforcement learning, especially the deep Q-learning algorithm to learn the agent's decision policy. Actually, we construct an end-to-end trainable framework for the action detection task, which includes a proposal generation network based on deep Q-learning, and the classification and regression networks responsible for the action category prediction and the action location adjustment, respectively. In addition, we design a long short-term memory structure upon extracted convolutional neural network features of sparsely sampled frames to generate the effective feature representations for video sequences of various durations. We evaluate the action proposal performance of our approach on THUMOS'14 and assess the generalization ability for unseen action categories on ActivityNet. We also compare the action detection performance of ours with other state-of-the-art methods on both data sets. Experiment results validate the effectiveness of the proposed approach, which can achieve comparative or superior performance than other action detection methods via much fewer proposals.
URL标识查看原文
语种英语
内容类型期刊论文
源URL[http://ir.siat.ac.cn:8080/handle/172644/13678]  
专题深圳先进技术研究院_集成所
推荐引用方式
GB/T 7714
Li, Nan-Nan,Guo, Hui-Wen,Zhao, Yang,et al. Active Temporal Action Detection in Untrimmed Videos via Deep Reinforcement Learning[J]. IEEE ACCESS,2018.
APA Li, Nan-Nan,Guo, Hui-Wen,Zhao, Yang,Li, Thomas,&Li, Ge.(2018).Active Temporal Action Detection in Untrimmed Videos via Deep Reinforcement Learning.IEEE ACCESS.
MLA Li, Nan-Nan,et al."Active Temporal Action Detection in Untrimmed Videos via Deep Reinforcement Learning".IEEE ACCESS (2018).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace