CORC  > 兰州理工大学  > 兰州理工大学  > 计算机与通信学院
Fuzzy clustering based on semantic body and its application in Chinese spam filtering
Zhang, Qiu-yu1,2; Yang, Hui-juan1; Wang, Peng1; Ma, Wei1
刊名International Journal of Digital Content Technology and its Applications
2011-04-01
卷号5期号:4页码:1-11
关键词Cluster analysis Electronic mail Fuzzy clustering Fuzzy systems Chinese spam Equivalence relations Hownet Semantic bodies Similarity analysis
ISSN号19759339
DOI10.4156/jdcta.vol5.issue4.22
英文摘要E-mail's text is the main body of an E-mail. Its content is reflected by semantic body formed by a large number of semantic elements, so it is the most authoritative and effective to study semantic body information of spam when analyzing its text. Firstly, this paper takes the advantage of HowNet in analysis of semantic element and analyze semantic bodies in email text, then proposes the method of constructing semantic body and calculation ways of similarity between semantic bodies based on sentence similarity. Secondly, for the problem of Imprecision and Fuzziness existing in current spam filtering technology, we use fuzzy clustering method to solve it. Combining fuzzy clustering with the semantic body, the paper proposes the method of fuzzy clustering based on semantic body. It is different from the traditional methods that semantic body is used as the object to be classified and the similarity between semantic bodies used as similarity coefficient in the proposed method. The method reduces the dimension when we use fuzzy clustering method to deal with text clustering problem. Finally, we apply the new method of fuzzy clustering based on semantic body to spam filtering. The result of the experiment shows that this method is more objective in determining email content when comparing with the method of traditional email filtering in semantic unit. The proposed method reflects much better in recall rate of discernment of email for spam whose meaning is expressed unclearly.
语种英语
出版者Advanced Institute of Convergence Information Technology
内容类型期刊论文
源URL[http://ir.lut.edu.cn/handle/2XXMBERH/111607]  
专题计算机与通信学院
作者单位1.School of Computer and Communication, Lanzhou University of Technology, Lanzhou Gansu 730050, China;
2.Key Laboratory of Gansu Advanced Control for Industrial Processes, Lanzhou Gansu 730050, China
推荐引用方式
GB/T 7714
Zhang, Qiu-yu,Yang, Hui-juan,Wang, Peng,et al. Fuzzy clustering based on semantic body and its application in Chinese spam filtering[J]. International Journal of Digital Content Technology and its Applications,2011,5(4):1-11.
APA Zhang, Qiu-yu,Yang, Hui-juan,Wang, Peng,&Ma, Wei.(2011).Fuzzy clustering based on semantic body and its application in Chinese spam filtering.International Journal of Digital Content Technology and its Applications,5(4),1-11.
MLA Zhang, Qiu-yu,et al."Fuzzy clustering based on semantic body and its application in Chinese spam filtering".International Journal of Digital Content Technology and its Applications 5.4(2011):1-11.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace