首页 | 本学科首页   官方微博 | 高级检索  
     


Mining and modeling linkage information from citation context for improving biomedical literature retrieval
Authors:Xiaoshi Yin  Jimmy Xiangji Huang  Zhoujun Li
Affiliation:1. School of Computer Science and Engineering, Beihang University, Beijing, China;2. School of Information Technology, York University, Toronto, Canada
Abstract:Mining linkage information from the citation graph has been shown to be effective in identifying important literatures. However, the question of how to utilize linkage information from the citation graph to facilitate literature retrieval still remains largely unanswered. In this paper, given the context of biomedical literature retrieval, we first conduct a case study in order to find out whether applying PageRank and HITS algorithms directly to the citation graph is the best way of utilizing citation linkage information for improving biomedical literature retrieval. Second, we propose a probabilistic combination framework for integrating citation information into the content-based information retrieval weighting model. Based on the observations of the case study, we present two strategies for modeling the linkage information contained in the citation graph. The proposed framework provides a theoretical support for the combination of content and linkage information. Under this framework, exhaustive parameter tuning can be avoided. Extensive experiments on three TREC Genomics collections demonstrate the advantages and effectiveness of our proposed methods.
Keywords:Probabilistic model   Citation analysis   Ranking   Biomedical information retrieval
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号