Adaptive Dilated Convolution For Human Pose Estimation
Zhengxiong Luo1,2,3,4; Zhicheng Wang4; Yan Huang1,2; Liang Wang1,2; Tieniu Tan2; Erin Zhou4
2022-08
会议日期2022-8
会议地点Montréal Québec
英文摘要

Most existing human pose estimation (HPE) methods exploit multi-scale information by fusing feature maps of four different spatial sizes, i.e. 1/4, 1/8, 1/16, and 1/32 of the input image. There are two drawbacks of this strategy: 1) feature maps of different spatial sizes may be not well spatially aligned, which potentially hurts the accuracy of keypoint location; 2) these scales are fixed and inflexible, which may restrict the generalization ability over various human sizes. To- wards these issues, we propose an adaptive dilated convolution (ADC). It can generate and fuse multi-scale features of the same spatial sizes by setting different dilation rates for different channels. Specifically, it uses a regression module to adaptively generate dilation rates for different channels. This also enables ADC to adjust the fused scales according to the sizes of test persons, and thus helps ADC to have better generalization ability. ADC can be end-to-end trained and easily plugged into existing methods. Extensive experiments show that ADC can bring consistent improvements to various HPE methods. The source codes will be released for further research.

语种英语
内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/52066]  
专题自动化研究所_智能感知与计算研究中心
通讯作者Yan Huang
作者单位1.Institute of Automation, Chinese Academy of Sciences (CASIA)
2.Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition (NLPR)
3.School of Artificial Intelligence, University of Chinese Academy of Sciences (UCAS)
4.Megvii Inc
推荐引用方式
GB/T 7714
Zhengxiong Luo,Zhicheng Wang,Yan Huang,et al. Adaptive Dilated Convolution For Human Pose Estimation[C]. 见:. Montréal Québec. 2022-8.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace