Intelligent Indexing and Semantic Retrieval of Multimodal Documents |
| |
Authors: | Rohini K. Srihari Zhongfei Zhang Aibing Rao |
| |
Affiliation: | (1) Center for Document Analysis and Recognition (CEDAR), UB Commons, State University of New York at Buffalo, 520 Lee Entrance-Suite 202, Buffalo, NY 14228-2583, USA |
| |
Abstract: | Finding useful information from large multimodal document collections such as the WWW without encountering numerous false positives poses a challenge to multimedia information retrieval systems (MMIR). This research addresses the problem of finding pictures. The fact that images do not appear in isolation, but rather with accompanying, collateral text is exploited. Taken independently, existing techniques for picture retrieval using (i) text-based and (ii) image-based methods have several limitations. This research presents a general model for multimodal information retrieval that addresses the following issues: (i) users' information need, (ii) expressing information need through composite, multimodal queries, and (iii) determining the most appropriate weighted combination of indexing techniques in order to best satisfy information need. A machine learning approach is proposed for the latter. The focus is on improving precision and recall in a MMIR system by optimally combining text and image similarity. Experiments are presented which demonstrate the utility of individual indexing systems in improving overall average precision. |
| |
Keywords: | multimedia information retrieval content-based retrieval image indexing text indexing multimodal query processing |
本文献已被 SpringerLink 等数据库收录! |
|