首页 | 本学科首页   官方微博 | 高级检索  
     检索      

网络博客空间中基于半监督学习的垃圾评论检测
引用本文:郭利强.网络博客空间中基于半监督学习的垃圾评论检测[J].图书情报工作,2012,56(4):52-55.
作者姓名:郭利强
作者单位:洛阳师范学院教育科学学院
摘    要:针对网络博客空间中垃圾评论泛滥的问题,给出一种半监督学习式网络垃圾评论检测方案。基于评论内容的统计分析,设计相关度、词组重复率、超链接数目、内容淫秽度、句子长度共5个特征指标,给出网络垃圾评论检测系统的框架,并进行实验验证。实验结果表明,本方法能有效检测出网络博客空间中的垃圾评论,具有较好的应用价值。

关 键 词:半监督学习  检测技术  网络博客空间  垃圾评论  
收稿时间:2011-07-28
修稿时间:2011-09-14

Detecting Techniques Based on Semi-supervised Learning for Comment Spam in Internet Blogsphere
Guo Liqiang.Detecting Techniques Based on Semi-supervised Learning for Comment Spam in Internet Blogsphere[J].Library and Information Service,2012,56(4):52-55.
Authors:Guo Liqiang
Institution:Education Science College,Luoyang Normal University,
Abstract:To identify comment spam which has flooded in the blogsphere,a detecting solution for comment spam was proposed by using immune optimization.The key features of comment including content-related validity,phrase recurrence rate,number of hyperlink,obscenity,and length of sentence was expound,a frame of immune optimization algorithm was given,the framework of detecting system for comment spam were designed,and simulation experiments were done to validate our detecting solution.Experimental result shows that detecting solution given in this paper can identify comment spam availably,and has the advantage of good application value.
Keywords:semi-supervised learning detecting techniques internet blogsphere comment spare
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《图书情报工作》浏览原始摘要信息
点击此处可从《图书情报工作》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号