Robust person head detection based on multi-scale representation fusion of deep convolution neural network | |
Wang Yingying1,2![]() ![]() ![]() ![]() ![]() | |
2017 | |
会议日期 | 2017.12.5-2017.12.8 |
会议地点 | 澳门 |
关键词 | Person Head Detection Deep Convolution Neural Network Multi-scale Representations Multi-task Learning |
页码 | 296-301 |
英文摘要 |
Person head detection is still a challenge due to the large variability in heads’ sizes and orientations, lighting conditions and strong occlusions. Small heads require local information contained in low level layers instead of semantic features of upper layers. But most of these fine details are lost in the early convolutional layers of the deep convolution neural networks (DCNN). In order to improve the overall detection accuracy, it is important to utilize local information from lower layers into the detection framework. In this letter, we use multi-scale representation fusion of DCNN as a way to incorporate lower layers with upper layers for detection. Our proposed model is based on the recent object detection network Single Shot MultiBox Detector (SSD). VGG16 is used as the base network. Batch normalization (BN) layers are used in our proposed multi-task learning method to accelerate training process and improve the robustness. Compared to state-of-the-art methods, our proposed detector achieves superior person head detection performance on the HollywoodHeads dataset (81.0 AP) and Casablance dataset (78.5 AP). |
内容类型 | 会议论文 |
源URL | [http://ir.ia.ac.cn/handle/173211/19753] ![]() |
专题 | 精密感知与控制研究中心_精密感知与控制 |
作者单位 | 1.自动化研究所精密感知与控制研究中心 2.中国科学院大学 |
推荐引用方式 GB/T 7714 | Wang Yingying,Yin Yingjie,Wu Wenqi,et al. Robust person head detection based on multi-scale representation fusion of deep convolution neural network[C]. 见:. 澳门. 2017.12.5-2017.12.8. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论