首页 | 本学科首页   官方微博 | 高级检索  
     检索      

结合中文分词的贝叶斯文本分类
引用本文:WEI Xiao-ning,朱巧明,LIANG Xing-yan.结合中文分词的贝叶斯文本分类[J].苏州市职业大学学报,2008,19(1):104-107.
作者姓名:WEI Xiao-ning  朱巧明  LIANG Xing-yan
作者单位:苏州大学,江苏,苏州,215021
摘    要:文本分类是组织大规模文档数据的基础和核心。朴素贝叶斯文本分类方法是种简单且有效的文本分类算法,但是属性间强独立性的假设在现实中并不成立,借鉴概率论中的多项式模型,结合中文分词过程,引入特征词条权重,给出了改进Bayes方法。并由实验验证和应用本方法,文本分类的效率得到了提高。

关 键 词:文本分类  贝叶斯  分词

Using Bayesian in Text Classification with Participle-method
WEI Xiao-ning,ZHU Qiao-ming,LIANG Xing-yan.Using Bayesian in Text Classification with Participle-method[J].Journal of Suzhou Vocational University,2008,19(1):104-107.
Authors:WEI Xiao-ning  ZHU Qiao-ming  LIANG Xing-yan
Institution:WEI Xiao-ning, ZHU Qiao-ming, LIANG Xing-yan (1.Suzhou University, Suzhou 215006, China; 2.Nantong University, Nantong 226007, China)
Abstract:Text classification is the base and core of processing large amount of document data. Native Bayes text classifier is a simple and effective text classification method. Text classification is the key technology in organizing and processing large amount of document data. The practical Bayes algorithm is an useful technique which has an assumption of strong independence of different properties. Based on the polynomial model, a way in feature abstraction considering word-weight and participle-method is introduced. At last the experiments show that efficiency of text classification is improved.
Keywords:text classification  Bayes  participle
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号