首页 | 本学科首页   官方微博 | 高级检索  
     


Intelligent Indexing and Semantic Retrieval of Multimodal Documents
Authors:Rohini K. Srihari  Zhongfei Zhang  Aibing Rao
Affiliation:(1) Center for Document Analysis and Recognition (CEDAR), UB Commons, State University of New York at Buffalo, 520 Lee Entrance-Suite 202, Buffalo, NY 14228-2583, USA
Abstract:Finding useful information from large multimodal document collections such as the WWW without encountering numerous false positives poses a challenge to multimedia information retrieval systems (MMIR). This research addresses the problem of finding pictures. The fact that images do not appear in isolation, but rather with accompanying, collateral text is exploited. Taken independently, existing techniques for picture retrieval using (i) text-based and (ii) image-based methods have several limitations. This research presents a general model for multimodal information retrieval that addresses the following issues: (i) users' information need, (ii) expressing information need through composite, multimodal queries, and (iii) determining the most appropriate weighted combination of indexing techniques in order to best satisfy information need. A machine learning approach is proposed for the latter. The focus is on improving precision and recall in a MMIR system by optimally combining text and image similarity. Experiments are presented which demonstrate the utility of individual indexing systems in improving overall average precision.
Keywords:multimedia information retrieval  content-based retrieval  image indexing  text indexing  multimodal query processing
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号