CORC  > 北京大学  > 信息科学技术学院
Chinese abbreviation identification using abbreviation-template features and context information
Sun, Xu ; Wang, Houfeng
2006
英文摘要Chinese abbreviations are frequently used without being defined, which has brought much difficulty into NLP. In this study, the definition-independent abbreviation identification problem is proposed and resolved as a classification task in which abbreviation candidates are classified as either, 'abbreviation' or 'non-abbreviation' according to the posterior probability. To meet our aim of identifying new abbreviations from existing ones, our solution is to add generalization capability to the abbreviation lexicon by replacing words with word classes and therefore create abbreviation-templates. By utilizing abbreviation-template features as well as context information, a SVM model is employed as the classifier. The evaluation on a raw Chinese corpus obtains an encouraging performance. Our experiments further demonstrate the improvement after integrating with morphological analysis, substring analysis and person name identification.; Computer Science, Artificial Intelligence; Computer Science, Information Systems; EI; CPCI-S(ISTP); 1
语种英语
DOI标识10.1007/11940098_26
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/293532]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Sun, Xu,Wang, Houfeng. Chinese abbreviation identification using abbreviation-template features and context information. 2006-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace