Blind image quality assessment via learnable attention-based pooling
Gu, Jie1,2; Meng, Gaofeng1; Xiang, Shiming1,2; Pan, Chunhong1
刊名PATTERN RECOGNITION
2019-07-01
卷号91页码:332-344
关键词Image quality assessment Perceptual image quality Visual attention Convolutional neural network Learnable pooling
ISSN号0031-3203
DOI10.1016/j.patcog.2019.02.021
通讯作者Meng, Gaofeng(gfmeng@nlpr.ia.ac.cn)
英文摘要Many recent algorithms based on convolutional neural network (CNN) for blind image quality assessment (BIQA) share a common two-stage structure, i.e., local quality measurement followed by global pooling. In this paper, we mainly focus on the pooling stage and propose an attention-based pooling network (APNet) for BIQA. The core idea is to introduce a learnable pooling that can model human visual attention in a data-driven manner. Specifically, the APNet is built by incorporating an attention module and allows for a joint learning of local quality and local weights. It can automatically learn to assign visual weights while generating quality estimations. Moreover, we further introduce a correlation constraint between the estimated local quality and attention weight in the network to regulate the training. The constraint penalizes the case in which the local quality estimation on a region attracting more attention differs a lot from the overall quality score. Experimental results on benchmark databases demonstrate that our APNet achieves state-of-the-art prediction accuracy. By yielding an attention weight map as by-product, our model gives a better interpretability on the learned pooling. (C) 2019 Elsevier Ltd. All rights reserved.
资助项目National Natural Science Foundation of China[91646207] ; National Natural Science Foundation of China[61773377] ; National Natural Science Foundation of China[61573352] ; Beijing Natural Science Foundation[L172053]
WOS关键词PERCEPTUAL IMAGE ; VISUAL-ATTENTION ; SCORES
WOS研究方向Computer Science ; Engineering
语种英语
出版者ELSEVIER SCI LTD
WOS记录号WOS:000466250400027
资助机构National Natural Science Foundation of China ; Beijing Natural Science Foundation
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/23709]  
专题自动化研究所_模式识别国家重点实验室
通讯作者Meng, Gaofeng
作者单位1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
推荐引用方式
GB/T 7714
Gu, Jie,Meng, Gaofeng,Xiang, Shiming,et al. Blind image quality assessment via learnable attention-based pooling[J]. PATTERN RECOGNITION,2019,91:332-344.
APA Gu, Jie,Meng, Gaofeng,Xiang, Shiming,&Pan, Chunhong.(2019).Blind image quality assessment via learnable attention-based pooling.PATTERN RECOGNITION,91,332-344.
MLA Gu, Jie,et al."Blind image quality assessment via learnable attention-based pooling".PATTERN RECOGNITION 91(2019):332-344.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace