Robust person head detection based on multi-scale representation fusion of deep convolution neural network
Wang Yingying1,2; Yin Yingjie1; Wu Wenqi1,2; Sun Siyang1,2; Wang Xingang1
2017
会议日期2017.12.5-2017.12.8
会议地点澳门
关键词Person Head Detection Deep Convolution Neural Network Multi-scale Representations Multi-task Learning
页码296-301
英文摘要
Person head detection is still a challenge due to the large variability in heads’ sizes and orientations, lighting conditions and strong occlusions. Small heads require local information contained in low level layers instead of semantic features of upper layers. But most of these fine details are lost in the early convolutional layers of the deep convolution neural networks (DCNN). In order to improve the overall detection accuracy, it is important to utilize local information from lower layers into the detection framework. In this letter, we use multi-scale representation fusion of DCNN as a way to incorporate lower layers with upper layers for detection. Our proposed model is based on the recent object detection network Single Shot MultiBox Detector (SSD). VGG16 is used as the base network. Batch normalization (BN) layers are used in our proposed multi-task learning method to accelerate training process and improve the robustness. Compared to state-of-the-art methods, our proposed detector achieves superior person head detection performance on the HollywoodHeads dataset (81.0 AP) and Casablance dataset (78.5 AP).
内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/19753]  
专题精密感知与控制研究中心_精密感知与控制
作者单位1.自动化研究所精密感知与控制研究中心
2.中国科学院大学
推荐引用方式
GB/T 7714
Wang Yingying,Yin Yingjie,Wu Wenqi,et al. Robust person head detection based on multi-scale representation fusion of deep convolution neural network[C]. 见:. 澳门. 2017.12.5-2017.12.8.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace