首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Binary Interpolative Coding for Effective Index Compression
Authors:Alistair Moffat  Lang Stuiver
Institution:(1) Department of Computer Science and Software Engineering, The University of Melbourne, 3010, Australia;(2) Department of Computer Science and Software Engineering, The University of Melbourne, 3010, Australia
Abstract:Information retrieval systems contain large volumes of text, and currently have typical sizes into the gigabyte range. Inverted indexes are one important method for providing search facilities into these collections, but unless compressed require a great deal of space. In this paper we introduce a new method for compressing inverted indexes that yields excellent compression, fast decoding, and exploits clustering—the tendency for words to appear relatively frequently in some parts of the collection and infrequently in others. We also describe two other quite separate applications for the same compression method: representing the MTF list positions generated by the Burrows-Wheeler Block Sorting transformation; and transmitting the codebook for semi-static block-based minimum-redundancy coding.
Keywords:index compression  context-based model  document database
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号