Performance of query processing implementations in ranking-based text retrieval systems using inverted indices |
| |
Authors: | B. Barla Cambazoglu Cevdet Aykanat |
| |
Affiliation: | Computer Engineering Department, Bilkent University, TR 06800 Bilkent, Ankara, Turkey |
| |
Abstract: | Similarity calculations and document ranking form the computationally expensive parts of query processing in ranking-based text retrieval. In this work, for these calculations, 11 alternative implementation techniques are presented under four different categories, and their asymptotic time and space complexities are investigated. To our knowledge, six of these techniques are not discussed in any other publication before. Furthermore, analytical experiments are carried out on a 30 GB document collection to evaluate the practical performance of different implementations in terms of query processing time and space consumption. Advantages and disadvantages of each technique are illustrated under different querying scenarios, and several experiments that investigate the scalability of the implementations are presented. |
| |
Keywords: | Text retrieval Query processing Inverted index Similarity calculations Document ranking Complexity Scalability |
本文献已被 ScienceDirect 等数据库收录! |
|