A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection

Authors:	Benjamin?Piwowarski author-information" > author-information__contact u-icon-before" > mailto:bpiwowar@dcc.uchile.cl" title=" bpiwowar@dcc.uchile.cl" itemprop=" email" data-track=" click" data-track-action=" Email author" data-track-label=" " >Email author,Patrick?Gallinari

Affiliation:	(1) Center for Web Research, DCC, Universidad de Chile, Blanco Encalada 2120, Santiago, Chile;(2) LIP6, 8, rue du capitaine Scott, 75015 Paris, France

Abstract:	Most recent document standards like XML rely on structured representations. On the other hand, current information retrieval systems have been developed for flat document representations and cannot be easily extended to cope with more complex document types. The design of such systems is still an open problem. We present a new model for structured document retrieval which allows computing scores of document parts. This model is based on Bayesian networks whose conditional probabilities are learnt from a labelled collection of structured documents—which is composed of documents, queries and their associated assessments. Training these models is a complex machine learning task and is not standard. This is the focus of the paper: we propose here to train the structured Bayesian Network model using a cross-entropy training criterion. Results are presented on the INEX corpus of XML documents.

Keywords:	Bayesian Networks structured information retrieval XML machine learning for structured retrieval
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏