Fuzzy clustering based on semantic body and its application in Chinese spam filtering | |
Zhang, Qiu-yu1,2; Yang, Hui-juan1; Wang, Peng1; Ma, Wei1 | |
刊名 | International Journal of Digital Content Technology and its Applications
![]() |
2011-04-01 | |
卷号 | 5期号:4页码:1-11 |
关键词 | Cluster analysis Electronic mail Fuzzy clustering Fuzzy systems Chinese spam Equivalence relations Hownet Semantic bodies Similarity analysis |
ISSN号 | 19759339 |
DOI | 10.4156/jdcta.vol5.issue4.22 |
英文摘要 | E-mail's text is the main body of an E-mail. Its content is reflected by semantic body formed by a large number of semantic elements, so it is the most authoritative and effective to study semantic body information of spam when analyzing its text. Firstly, this paper takes the advantage of HowNet in analysis of semantic element and analyze semantic bodies in email text, then proposes the method of constructing semantic body and calculation ways of similarity between semantic bodies based on sentence similarity. Secondly, for the problem of Imprecision and Fuzziness existing in current spam filtering technology, we use fuzzy clustering method to solve it. Combining fuzzy clustering with the semantic body, the paper proposes the method of fuzzy clustering based on semantic body. It is different from the traditional methods that semantic body is used as the object to be classified and the similarity between semantic bodies used as similarity coefficient in the proposed method. The method reduces the dimension when we use fuzzy clustering method to deal with text clustering problem. Finally, we apply the new method of fuzzy clustering based on semantic body to spam filtering. The result of the experiment shows that this method is more objective in determining email content when comparing with the method of traditional email filtering in semantic unit. The proposed method reflects much better in recall rate of discernment of email for spam whose meaning is expressed unclearly. |
语种 | 英语 |
出版者 | Advanced Institute of Convergence Information Technology |
内容类型 | 期刊论文 |
源URL | [http://ir.lut.edu.cn/handle/2XXMBERH/111607] ![]() |
专题 | 计算机与通信学院 |
作者单位 | 1.School of Computer and Communication, Lanzhou University of Technology, Lanzhou Gansu 730050, China; 2.Key Laboratory of Gansu Advanced Control for Industrial Processes, Lanzhou Gansu 730050, China |
推荐引用方式 GB/T 7714 | Zhang, Qiu-yu,Yang, Hui-juan,Wang, Peng,et al. Fuzzy clustering based on semantic body and its application in Chinese spam filtering[J]. International Journal of Digital Content Technology and its Applications,2011,5(4):1-11. |
APA | Zhang, Qiu-yu,Yang, Hui-juan,Wang, Peng,&Ma, Wei.(2011).Fuzzy clustering based on semantic body and its application in Chinese spam filtering.International Journal of Digital Content Technology and its Applications,5(4),1-11. |
MLA | Zhang, Qiu-yu,et al."Fuzzy clustering based on semantic body and its application in Chinese spam filtering".International Journal of Digital Content Technology and its Applications 5.4(2011):1-11. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论