CORC  > 北京大学  > 信息科学技术学院
Mathematical Formula Identification in PDF Documents
Lin, Xiaoyan ; Gao, Liangcai ; Tang, Zhi ; Lin, Xiaofan ; Hu, Xuan
2011
关键词mathematical expression recognition formula extraction PDF document
英文摘要Recognizing mathematical expressions in PDF documents is a new and important field in document analysis. It is quite different from extracting mathematical expressions in image-based documents. In this paper, we propose a novel method by combining rule-based and learning-based methods to detect both isolated and embedded mathematical expressions in PDF documents. Moreover, various features of formulas, including geometric layout, character and context content, are used to adapt to a wide range of formula types. Experimental results show satisfactory performance of the proposed method. Furthermore, the method has been successfully incorporated into a commercial software package for large-scale Chinese e-Book production.; http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000343450700280&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=8e1609b174ce4e31116a60747a720701 ; Computer Science, Artificial Intelligence; Engineering, Electrical & Electronic; EI; CPCI-S(ISTP); 5
语种英语
DOI标识10.1109/ICDAR.2011.285
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/321246]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Lin, Xiaoyan,Gao, Liangcai,Tang, Zhi,et al. Mathematical Formula Identification in PDF Documents. 2011-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace