首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
如今文本自动分类技术发展已较为成熟,中文网页的分类也是自动分类技术的应用之一.分类精度依赖于分类算法,贝叶斯算法在网页分类中有很广泛的使用,但它需要大量且已标记的训练集,而获得大量带有类别标注的样本代价很高.本文以中文网页信息增量式的学习作为研究对象,利用网页已验信息处理训练集增量问题,提出一种改进的增量式的贝叶斯分类算法,研究利用未标记的中文网页来提高分类器的性能,并进行相关实验对比和评价.  相似文献   

2.
常用的网页分类技术大多基于普通文本分类方法,没有充分考虑到网页分类的特殊性--网页本身的半结构化特征以及网页中存在大量干扰分类的噪音信息,同时多数网页分类的测试集和训练集采源于同一个样本集而忽视了测试集中可能包含无类别样本的可能.基于向量空间模型,将样本集看成由有类别样本和无类别样本两部分组成,同时选择了样本集来自于相同的网站,在去除网页噪音基础上结合文本相似度算法和最优截尾法,提出了一种基于不完整数据集的网页分类技术LUD(Learning by Unlabeled Data)来改善分类效果,提高分类精度.实验证明:LUD算法与传统的分类方法相比较而言,不但可以提高已有类别样本的分类精度,更主要的是提供了一种发现新类别样本的方法.  相似文献   

3.
针对教学网页这一特定领域,提出一个基于K近邻算法的教学网页自动分类模型。该模型采用向量空间模型对教学网页的特征进行量化,并采用基于K近邻的分类方法对新的网页进行自动归类。最后通过实验数据说明该算法在教学网页的分类中是有效。  相似文献   

4.
利用构造性学习(CML)算法训练分类器需要大量已标记样本,然而获取大量已标记的样本较为困难.为此,提出了一种人脑半监督的构造性学习算法(HPSS-CML).根据已标记样本,通过覆盖算法构造分类网络,对未标记样本进行有选择的标记,并将其加入训练集,调整分类网络参数.重复进行上述过程,直到没有新标记的样本为止,得到最终的分类器.测试阶段再次利用未标记样本对"拒认状态"的测试样本进行标记.最后选取UCI数据集进行实验,结果表明,与CML算法及Tri-CML算法相比,该方法的分类更为有效.  相似文献   

5.
对于已经分类的数据和大量未分类数据,在运算过程中,采用一种新的半监督聚类算法为支持向量机提供新的训练数据.随后,利用支持向量机判别出所有数据的类别属性,并选取最可靠的点加入已分类集合.为了验证算法的效率,收集了67张黄瓜叶片色调的数字信息,并对具有6个已分类数据与61个未分类数据的数据集进行半监督聚类分析,以判断这些叶片的健康程度.结果表明,该聚类算法优于其他算法.  相似文献   

6.
由于文本表示直接影响文本分类的效果,该文提出了一种有监督局部保持索引的文本表示方法.该方法利用Jaccard系数确定同一类别中文本之间的相似性,找出样本对应在低维空间中的文本表示.采用K近邻分类器在Reuters-21578数据集上进行训练和测试.实验结果表明,有监督保局索引方法在文本表示上更有优势.  相似文献   

7.
结合蚁群算法在解决分类问题方面的优势,以及中文网页内容特征值的离散性特点,提出一种改进的基于蚁群算法的网页分类方法。该算法通过携带类别信息的种群蚂蚁的爬行,在迭代过程中寻找一条最佳路径与之匹配,实现了Web页面的分类。最佳路径通过计算测试文档与每一类别的覆盖集合,进而比较最优覆盖集合得到。其中类别权重计算中引入了文字链接比和标签权值,进一步提高了分类精度。实验证明,引入类别覆盖集的蚁群分类算法能够取得更好的分类效果。  相似文献   

8.
电类实验教学过程中人工评判学生所测数据工作烦琐,影响了教学质量和效率。该文提出了改进的K近邻(K-nearest neighbors,KNN)分类算法,即基于均值漂移、安全间隔和核主成分分析(KPCA)的M-KPCA-KNN(KNN based on margin and KPCA)算法,以判断学生测量数据正确与否和错误原因。首先利用KPCA对高维实验数据进行降维,然后利用均值漂移向量找到不同类别数据的最密集位置,并在不同类别数据的边界设置安全间隔,最后,将与待测样本距离最近的k个数据设置权重,计算每个类别的权重和,权重和最大的类别为待测样本的类别。与现有的KNN算法相比,M-KPCA-KNN算法不仅提高了分类正确率,而且降低了时间复杂度。  相似文献   

9.
基于层次的模糊K均值聚类算法研究   总被引:1,自引:0,他引:1  
通过对K均值聚类算法的研究,本文提出了一种基于层次聚类与模糊聚类思想的K均值聚类算法。算法首先使用层次方法对数据进行初始聚类,然后用得到的聚类数作为模糊K均值聚类中的K值,对聚类进行修正。最后通过实验,验证了该算法不需要人为假设聚类算法中的K值,而且引入了模糊隶属关系使类别的划分更接近于事实,从而证明了该算法的有效性。  相似文献   

10.
互联网加速了物流业发展,地下物流网络节点选址成为新的研究热点。将二分K 均值算法和免疫算法相结合,对物流中转分配节点(一、二级节点)选址进行了研究。首先根据问题的约束条件和优化目标建立物流一、二级节点选址数学模型,然后采用二分K 均值算法和免疫算法求解最佳一、二级节点选址方案。对南京市仙林区110个物流节点分配方案进行实验,结果表明该算法能很好地解决组合优化问题。  相似文献   

11.
《海外英语》2007,(5):44-45
It is worthy of noting that, whilst Crookston Castle witnessed the earlier and happier portion of Mary's variegated life,  相似文献   

12.
一、吃和喝吃苹果 eat an apple, 吃药 take medicine,吃糖 have some sweets,吃饭 have one's meals,吃馆子 dine out,吃惊 be surprised/  相似文献   

13.
《海外英语》2007,(5):10-11
Many college freshmen arrive woefully unprepared to do college work, and as disadvantaged populations continue to grow, the share of the American work force that has made it through college is expected to plummet. Many experts blame that educational failure not just on high schools but also on colleges. School & College, a special report by The Chronicle, looks at efforts to fix the system. What reforms would better prepare students for college? What should schools and colleges be doing differently? How should state and federal officials help?  相似文献   

14.
The communication of people partially is the communication of cultures. Culture has a direct effect on international commercial activities in all aspects. Different conceptions about time, space, equality, law and the like, lead people to deal with things in different ways. So to know cultures of the counterpart is to facil-itate our enterprises so as to have a smooth and successful communication in commercial activity.  相似文献   

15.
《海外英语》2007,(4):36
There are numbers of crossroads on our long and unpredictable life journey where we totally have no idea about which direction to choose. No matter what our decision is, we should not turn back, but face the music and go ahead instead. I am this kind of girl who always does try without regretting, one example is how I dealt with my love.  相似文献   

16.
王菲 《华章》2007,(12):273-273
Migration occurs behind a variety of reasons and has a great effect on the whole world. People may migrate in order to improve their economic situation, or in order to escape civil strife, persecution, and environmental disasters. The impact of migration is complex, bringing both benefits anddisadvantages. This paper briefly talks about the causes of migration, the allocation of benefits, and the ways in which individual countries and the international community deal with this important subject.  相似文献   

17.
裴水妹 《华章》2007,(11):196
Sister Carrie is one of the most controversial characters in American literature.Thought as a "fallen woman" firstly,she was defined as a "new woman" by some critics later. However, by digging into the motivaton behind the whole process of Carrie's "success", the relationship between Carrie and her creator (the author), the social conditions of then American, it can be found that Carrie has never been free-standing on her thought and she has never found her real-sdf even after becoming a famous actress. In a society dominated by mass consumerism Carrie is only an adherent of her own desires. She also is a representative of all those country girls flooded into cities, a symbol and a sacrifice of the urbanization of America in a time countryside was overcome by cities.  相似文献   

18.
19.
1.IntroductionOne-cyclecontrolmethod,whichwasproposedaboutonedecadeago[1],hasbecomeanattractivemethodinspecialfieldssuchaspowerfactorcorrection[2-6],switchingamplifiers[7,8],etc.Themainideaofthiscontrollerisbasedonintegrationofdiodevoltageinone-cycleandforcesittobeexactlyequaltothereferencevalue.Themainadvantageofthiscontrollerisitsrealtimeabilitytorejectthevariationofinputvoltage[1].Despitethisgreatability,ithasnogoodperformancesinrejectingofloaddisturbanceandfollowingreferencecommands.Espec…  相似文献   

20.
风的曲线     
Rosco and I wait for the fishermen to return.I sit at a wooden bench near the store at Mt.Baker Resort and watch the clouds change shape. Rosco has my belt around his neck and an eight foot tow chain hooked to a tree. Dogs must be on a leash. Ducks and rabbits are loose.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号