Intelligent Indexing and Semantic Retrieval of Multimodal Documents期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Intelligent Indexing and Semantic Retrieval of Multimodal Documents

Authors:	Rohini K. Srihari Zhongfei Zhang Aibing Rao

Affiliation:	(1) Center for Document Analysis and Recognition (CEDAR), UB Commons, State University of New York at Buffalo, 520 Lee Entrance-Suite 202, Buffalo, NY 14228-2583, USA

Abstract:	Finding useful information from large multimodal document collections such as the WWW without encountering numerous false positives poses a challenge to multimedia information retrieval systems (MMIR). This research addresses the problem of finding pictures. The fact that images do not appear in isolation, but rather with accompanying, collateral text is exploited. Taken independently, existing techniques for picture retrieval using (i) text-based and (ii) image-based methods have several limitations. This research presents a general model for multimodal information retrieval that addresses the following issues: (i) users' information need, (ii) expressing information need through composite, multimodal queries, and (iii) determining the most appropriate weighted combination of indexing techniques in order to best satisfy information need. A machine learning approach is proposed for the latter. The focus is on improving precision and recall in a MMIR system by optimally combining text and image similarity. Experiments are presented which demonstrate the utility of individual indexing systems in improving overall average precision.

Keywords:	multimedia information retrieval content-based retrieval image indexing text indexing multimodal query processing
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏