结合中文分词的贝叶斯文本分类 Using Bayesian in Text Classification with Participle-method期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

结合中文分词的贝叶斯文本分类

引用本文：	WEI Xiao-ning,朱巧明,LIANG Xing-yan.结合中文分词的贝叶斯文本分类[J].苏州市职业大学学报,2008,19(1):104-107.

作者姓名：	WEI Xiao-ning 朱巧明 LIANG Xing-yan

作者单位：	苏州大学,江苏,苏州,215021

摘要：	文本分类是组织大规模文档数据的基础和核心。朴素贝叶斯文本分类方法是种简单且有效的文本分类算法，但是属性间强独立性的假设在现实中并不成立，借鉴概率论中的多项式模型，结合中文分词过程，引入特征词条权重，给出了改进Bayes方法。并由实验验证和应用本方法，文本分类的效率得到了提高。
关键词：	文本分类贝叶斯分词
Using Bayesian in Text Classification with Participle-method

WEI Xiao-ning,ZHU Qiao-ming,LIANG Xing-yan.Using Bayesian in Text Classification with Participle-method[J].Journal of Suzhou Vocational University,2008,19(1):104-107.

Authors:	WEI Xiao-ning ZHU Qiao-ming LIANG Xing-yan

Institution:	WEI Xiao-ning, ZHU Qiao-ming, LIANG Xing-yan (1.Suzhou University, Suzhou 215006, China; 2.Nantong University, Nantong 226007, China)

Abstract:	Text classification is the base and core of processing large amount of document data. Native Bayes text classifier is a simple and effective text classification method. Text classification is the key technology in organizing and processing large amount of document data. The practical Bayes algorithm is an useful technique which has an assumption of strong independence of different properties. Based on the polynomial model, a way in feature abstraction considering word-weight and participle-method is introduced. At last the experiments show that efficiency of text classification is improved.

Keywords:	text classification Bayes participle
本文献已被维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏