Ordered similarity measures taking into account the rank of documents |
| |
Institution: | 2. Department of Biology, East Carolina University, Greenville, NC, United States;1. Mohammed V University in Rabat, Faculty of Sciences, Laboratory of Mathematics, Computing and Applications, Rabat, Morocco;2. Informatics Department, Faculty of Sciences and Technologies of Mohammedia (FSTM), Hassan II University, Casablanca, Morocco |
| |
Abstract: | Indices of similarity are used to quantify the difference between two sets of documents. Usually, they are based on the number of elements that they have in common. Indeed, they are calculated from the results of the intersections or unions of the compared sets. But many studies show that order of presentation of the documents is an important fact to be taken into account, particularly in the case of system's evaluation, which is not the case as far as usual measures are concerned. In this article, we propose a general method for the construction of measures of similarity taking into account the rank of presentation of the document. We will call them Ordered Similarity measures, i.e., measures of OS. Then, we present an experimentation of evaluation used to quantify the filtering impact of a system. This protocol is based on a large scale interrogation of the system and on a comparison of answer sets. We present simultaneously the results of comparisons obtained by a classical measure and by an OS measure. Finally we show how to construct OS measures derived from recall and precision. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|