改进的朴素贝叶斯聚类Web文本分类挖掘技术 The Improved Naive Bayes Text Classification Data Mining Clustering Web期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

改进的朴素贝叶斯聚类Web文本分类挖掘技术

作者姓名：	高胜利

作者单位：	江苏财经职业技术学院,江苏淮安,223001

基金项目：	江苏省工程技术研究开发中心，淮安市创新载体平台建设计划

摘要：	通过对Web数据的特点进行详细的分析,在基于传统的贝叶斯聚类算法基础上,采用网页标记形式来有效地弥补朴素贝叶斯算法的不足,并将改进的方法应用在文本分类中,是一种很好的改进思路。最后实验结果也表明,此方法能够有效地对文本进行分类。
关键词：	Web挖掘朴素贝叶斯文本分类
The Improved Naive Bayes Text Classification Data Mining Clustering Web

Authors:	GAO Sheng-li

Institution:	GAO Sheng-li

Abstract:	This paper first introduced the Web mining and text classification of basic theory,specific to the Web data characteristics are analyzed in detail,mainly based on the traditional Bayesian clustering algorithm based on the proposed algorithm,the improvement of the webpage,marked form to effectively compensates for the naive Bayes algorithm is insufficient,will be improved method and its application in text classification,finally the experimental results show that the method can effectively classify the text.

Keywords:	Web mining naive Bayes text classification
本文献已被 CNKI 万方数据等数据库收录！