Context-Driven Image Caption With Global Semantic Relations of the Named Entities
Jing, Yun1,2; Zhiwei Xu3; Guanglai Gao2
刊名IEEE ACCESS
2020
卷号8页码:143584-143594
关键词Semantics Visualization Task analysis Computational modeling Knowledge based systems Correlation Organizations Image caption named entity semantic relation
ISSN号2169-3536
DOI10.1109/ACCESS.2020.3013321
英文摘要Automatic image captioning has achieved a great progress. However, the existing captioning frameworks basically enumerate the objects in the image. The generated captions lack the real-world knowledge about named entities and their relations, such as the relations among famous persons, organizations and buildings. On the contrary, humans interpret images in a specific way by providing real-world knowledge with relations of the aforementioned named entities. To generate human-like captions, we focus on captioning the images of news, which could provide real-world knowledge of the whole story behind the images. Then we propose a novel model that makes captions closer to the human-like description of the image, by leveraging the semantic relevance of the named entities. The named entities are not only extracted from news under the guidance of the image content, but also extended with external knowledge based on the semantic relations. In detail, we propose a sentence correlation analysis algorithm to selectively draw the contextual information from news, and use entity-linking algorithm based on knowledge graph to discover the relations of entities with a global sight. The results of extensive experiments based on real-world dataset which is collected from the news show that our model generates image captions closer to the corresponding real-world captions.
资助项目Natural Science Foundation of China[61650205] ; Natural Science Foundation of China[61540004] ; Natural Science Foundation of Inner Mongolia Autonomous Region[2018MS06003] ; Natural Science Foundation of Inner Mongolia Autonomous Region[2020MS06025] ; Natural Science Foundation of Education Department Inner Mongolian Autonomous Region[NJZY19083]
WOS研究方向Computer Science ; Engineering ; Telecommunications
语种英语
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
WOS记录号WOS:000560332200001
内容类型期刊论文
源URL[http://119.78.100.204/handle/2XEOYT63/15786]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Jing, Yun
作者单位1.Inner Mongolia Univ Technol, Coll Data Sci & Applicat, Hohhot 010080, Peoples R China
2.Inner Mongolia Univ, Dept Comp Sci, Hohhot 010021, Peoples R China
3.Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
推荐引用方式
GB/T 7714
Jing, Yun,Zhiwei Xu,Guanglai Gao. Context-Driven Image Caption With Global Semantic Relations of the Named Entities[J]. IEEE ACCESS,2020,8:143584-143594.
APA Jing, Yun,Zhiwei Xu,&Guanglai Gao.(2020).Context-Driven Image Caption With Global Semantic Relations of the Named Entities.IEEE ACCESS,8,143584-143594.
MLA Jing, Yun,et al."Context-Driven Image Caption With Global Semantic Relations of the Named Entities".IEEE ACCESS 8(2020):143584-143594.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace