Specific Textual Information Detection for Chinese Micro-blog | |
Liu, Kesong ; Niu, Yan ; Yang, Jianwu ; Wang, Jiushuo ; Cai, Huihui | |
2016 | |
关键词 | CLASSIFICATION |
英文摘要 | Long-term specific textual information detection is an interesting research problem. Batch processing method usually involves training a classifier with different train sets periodically to maintain its performance, since the context of specific textual information in the micro-blog space tends to change. The micro-blog data is an abundant source for detecting and analyzing the specific textual information. As a universal concept, the specific information can be information about any entity, such as movies, journeys and so on. If we can collect long-term specific information and analyze them, hidden data value maybe emerges. In this paper, we present an incremental learning method based on SVM to detect long-term specific information efficiently. Besides, topic words in different time periods about the specific information are also extracted. To test our ideas, we manually create a labeled data set about weight loss production from Chinese Sina micro-blog within one-year span with the help of a semi-supervised text classifier. Experiments show that our algorithm can maintain the detection performance quite well and find strong related topic words in different time periods.; CPCI-S(ISTP); 129-134 |
语种 | 英语 |
出处 | 6th International Conference on Information Technology for Manufacturing Systems (ITMS) |
内容类型 | 其他 |
源URL | [http://ir.pku.edu.cn/handle/20.500.11897/459868] |
专题 | 信息科学技术学院 |
推荐引用方式 GB/T 7714 | Liu, Kesong,Niu, Yan,Yang, Jianwu,et al. Specific Textual Information Detection for Chinese Micro-blog. 2016-01-01. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论