首页 | 本学科首页   官方微博 | 高级检索  
     检索      

面向中文问答社区的问题去重技术研究
引用本文:彭月娥,杨思春,李心磊,丁菲菲,向恒月.面向中文问答社区的问题去重技术研究[J].铁道师院学报,2014(1):76-80.
作者姓名:彭月娥  杨思春  李心磊  丁菲菲  向恒月
作者单位:安徽工业大学计算机科学与技术学院,安徽马鞍山243002
基金项目:计算机软件新技术国家重点实验室开放课题项目(KFKT2010B02);安徽省高校省级自然科学研究重点项目(KJ2011A048);安徽工业大学研究生创新研究项目(2012086)
摘    要:基于《知网》语义知识资源,提出一种基于问句相似度计算的问答社区问题去重方法。通过计算已有问题集合中问题间的语义相似度,将其中重复度较高的问题进行筛选并去除,从而提高用户获取所需信息的效率,改善用户体验。在“爱问知识人”的真实问题集上的实验结果表明:该方法获得了较好的去重效果。

关 键 词:相似度  相似度计算  问答社区

A study of duplicate removal technology based on Chinese CQA
PENG Yuee,YANG Sichun,LI Xinlei,DING Feifei,XIANG Hengyue.A study of duplicate removal technology based on Chinese CQA[J].Journal of Suzhou Railway Teachers College(Natural Science Edition),2014(1):76-80.
Authors:PENG Yuee  YANG Sichun  LI Xinlei  DING Feifei  XIANG Hengyue
Institution:(School of Computer Science and Technology. Anhui University of Tech,aology, Ma' anshan 243032, China)
Abstract:Based on the semantic knowledge resource of HowNet,a duplicate removal method focusing on the questions from CQA is proposed through computing similarity between sentences. The questions which own a high degree of similarity with others were selected and removed by calculating the semantic similarity between them. In this way,we increased the efficiency of users obtaining needed information and improved user experience. The experiment results on the questions from the URL “http://iask.sina.com.cn/” show a good duplicate removal ef-fect.
Keywords:similarity  similarity calculation  community questions and answers
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号