期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Mobile Agents for Distributed and Heterogeneous Information Retrieval 总被引：1，自引：0，他引：1

Subrata?Das Email author Kurt?Shuster Curt?Wu Igor?Levit 《Information Retrieval》2005,8(3):383-416

The heterogeneous, distributed and voluminous nature of many government and corporate data sources impose severe constraints on meeting the diverse requirements of users who analyze the data. Additionally, communication bandwidth limitations, time constraints, and multiple data formats impose further restrictions on users of these distributed data sources. In this paper, we present an Agent-based Complex QUerying and Information Retrieval Engine (ACQUIRE) for large, heterogeneous, and distributed data sources. ACQUIRE acts as a softbot or interface agent by presenting users with a view of a single, unified, homogenous data source, against which users can pose high-level declarative queries. ACQUIRE translates each such user query into a set of sub-queries by employing a combination of planning and traditional database query optimization techniques. ACQUIRE then spawns a set of mobile agents corresponding to these sub-queries, which in turn retrieve the data from various distributed data sources by dynamically optimizing the retrieval strategy as it is carried out. These mobile agents carry with them data-processing code that can be executed at the remote site, thus reducing the size of data returned by the agent. When all mobile agents have returned, ACQUIRE filters and merges the retrieved data and presents the results to the user. While the system is still very much a work in progress, current validation experiments on simulated NASA Distributed Active Archive Centers (DAACs) have demonstrated that complex queries can be effectively decomposed and retrieved by this approach. 相似文献

2.

Partial Collection Replication for Information Retrieval

Zhihong Lu Kathryn S. McKinley 《Information Retrieval》2003,6(2):159-198

The explosion of content in distributed information retrieval (IR) systems requires new mechanisms in order to attain timely and accurate retrieval of unstructured text. This paper shows how to exploit locality by building, using, and searching partial replicas of text collections in a distributed IR system. In this work, a partial replica includes a subset of the documents from larger collection(s) and the corresponding inference network search mechanism. For each query, the distributed system determines if partial replica is a good match and then searches it, or it searches the original collection. We demonstrate the scenarios where partial replication performs better than systems that use caches which only store previous query and answer pairs. We first use logs from THOMAS and Excite to examine query locality using query similarity versus exact match. We show that searching replicas can improve locality (from 3 to 19%) over the exact match required by caching. Replicas increase locality because they satisfy queries which are distinct but return the same or very similar answers. We then present a novel inference network replica selection function. We vary its parameters and compare it to previous collection selection functions, demonstrating a configuration that directs most of the appropriate queries to replicas in a replica hierarchy. We then explore the performance of partial replication in a distributed IR system. We compare it with caching and partitioning. Our validated simulator shows that the increases in locality due to replication make it preferable to caching alone, and that even a small increase of 4% in locality translates into a performance advantage. We also show a hybrid system with caches and replicas that performs better than each on their own. 相似文献

3.

The Probability Ranking Principle Revisited

Martin Wechsler Peter Schäuble 《Information Retrieval》2000,3(3):217-227

A theoretic framework for multimedia information retrieval is introduced which guarantees optimal retrieval effectiveness. In particular, a Ranking Principle for Distributed Multimedia-Documents (RPDM) is described together with an algorithm that satisfies this principle. Finally, the RPDM is shown to be a generalization of the Probability Ranking principle (PRP) which guarantees optimal retrieval effectiveness in the case of text document retrieval. The PRP justifies theoretically the relevance ranking adopted by modern search engines. In contrast to the classical PRP, the new RPDM takes into account transmission and inspection time, and most importantly, aspectual recall rather than simple recall. 相似文献

4.

国际医学会议信息检索简介 总被引：2，自引：0，他引：2

刘海航《新世纪图书馆》2007,(3):44-45

医学会议信息是了解医学领域最新发展状况的一个重要情报源。论文介绍了几个国内外医学会议信息网站以及有效获取医学会议信息的一些技巧。相似文献

5.

高校图书馆的文献采访工作必须掌握动态信息 总被引：7，自引：0，他引：7

徐卫《图书馆论坛》2004,24(4):207-208,211

探讨了高校图书馆文献采访工作应如何掌握动态信息，提高购书质量，加强馆藏建设的原则以及网络环境下文献资源建设的新模式。相似文献

6.

信息检索中修饰语作用的研究

马晖男吴江宁潘东华《情报学报》2006,25(3):306-311

在海量信息中检索时,与用户查询相关的信息常常被漏掉,而与查询无关的信息———信息垃圾,却大量地出现在检索结果中。改进文本信息检索系统的质量,提高检索效能,已成为亟待解决的问题。本文针对能够影响检索效力的一个易被忽略的因素———修饰语,研究其在文本信息检索中的作用。为此,构建了修正的向量空间模型(Modified Vector Space Model,MVSM),并以英文文本进行试验,进而说明修饰语的作用。相似文献

7.

Retrieving with Good Sense

Mark Sanderson 《Information Retrieval》2000,2(1):49-69

Although always present in text, word sense ambiguity only recently became regarded as a problem to information retrieval which was potentially solvable. The growth of interest in word senses resulted from new directions taken in disambiguation research. This paper first outlines this research and surveys the resulting efforts in information retrieval. Although the majority of attempts to improve retrieval effectiveness were unsuccessful, much was learnt from the research. Most notably a notion of under what circumstance disambiguation may prove of use to retrieval. 相似文献

8.

文检课为核心的信息素质教育模型的构建 总被引：6，自引：0，他引：6

刘梦溪《图书馆论坛》2007,27(1):49-50,151

阐述了韶关学院图书馆文检课的发展历程，特别是教学手段建设和教学方法改革方面取得的成效。随着课程体系的完善，新生教育和讲座的开展，学习网站和传统媒体的揉合，文检课为核心的信息素质教育模型业已凸显。相似文献

9.

2006-2007年国外信息检索基本理论研究进展

刘志辉黄国彬《图书馆建设》2008,(3):76-82

2006-2007年国外对信息检索基础理论的研究主要集中于决策理论、隐含语义索引理论研究以及信息检索评价理论研究。关于信息检索基本原理的研究主要集中在信息检索中的分类、信息检索模型、信息检索类型和检索方式等方面。信息检索中的分类的研究重点包括有关分类器的研究;有关特征选择的研究;有关领域相关词的研究。信息检索类型的研究主要包括焦点检索、图像检索、视频检索、合作过滤、机器音译、无线网中网。检索方式的研究主要包括上下文检索、集成检索、问答系统检索以及用户查询处理等问题。相似文献

10.

A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection

Benjamin?Piwowarski Email author Patrick?Gallinari 《Information Retrieval》2005,8(4):655-681

Most recent document standards like XML rely on structured representations. On the other hand, current information retrieval systems have been developed for flat document representations and cannot be easily extended to cope with more complex document types. The design of such systems is still an open problem. We present a new model for structured document retrieval which allows computing scores of document parts. This model is based on Bayesian networks whose conditional probabilities are learnt from a labelled collection of structured documents—which is composed of documents, queries and their associated assessments. Training these models is a complex machine learning task and is not standard. This is the focus of the paper: we propose here to train the structured Bayesian Network model using a cross-entropy training criterion. Results are presented on the INEX corpus of XML documents. 相似文献

11.

宏检索及SDARTS协议探讨

尹红《图书情报工作》2002,46(9):79-83

宏检索的目标是实现跨多个分布式异构数据源的检索,并能对检索结果进行有效整合.SDARTS协议撷取了SDLIP和STARTS两个协议的精华部分,构建了宏检索的基本框架.本文介绍目前宏检索的研究现状,对SDARTS协议的特点及有待完善的地方作-探讨. 相似文献

12.

近几年国内信息检索可视化研究综述 总被引：1，自引：0，他引：1

潘庆超《图书馆学研究》2010,(12)

文章首先简要介绍了国外信息检索可视化的研究现状,然后分析了国内近几年信息检索可视化的研究领域,包括理论探讨、技术开发及实际应用等几个方面。接着指出信息检索可视化发展所面临的问题。最后提出了信息检索可视化的未来展望。相似文献

13.

高校图书馆的文献信息服务 总被引：6，自引：0，他引：6

傅晓《图书馆论坛》2000,20(4):70-72

针对网络化条件下,高校图书馆用户需求的变化,提出了发展我国高校图书馆文献信息服务的若干对策。相似文献

14.

Using Corpus-Based Approaches in a System for Multilingual Information Retrieval

Martin Braschler Peter Schäuble 《Information Retrieval》2000,3(3):273-284

We present a system for multilingual information retrieval that allows users to formulate queries in their preferred language and retrieve relevant information from a collection containing documents in multiple languages. The system is based on a process of document level alignments, where documents of different languages are paired according to their similarity. The resulting mapping allows us to produce a multilingual comparable corpus. Such a corpus has multiple interesting applications. It allows us to build a data structure for query translation in cross-language information retrieval (CLIR). Moreover, we also perform pseudo relevance feedback on the alignments to improve our retrieval results. And finally, multiple retrieval runs can be merged into one unified result list. The resulting system is inexpensive, adaptable to domain-specific collections and new languages and has performed very well at the TREC-7 conference CLIR system comparison. 相似文献

15.

网络环境下高校信息检索课的有效教学 总被引：2，自引：0，他引：2

唐崇忻《晋图学刊》2007,(3):45-47,56

网络的迅速普及以及图书馆数字资源的发展,使得高校学生面对的信息环境与以往相比有了明显的变化.本文探讨当前信息环境下,信息检索课面临的问题和误区,并提出相应的解决办法. 相似文献

16.

文献信息可视化研究 总被引：8，自引：2，他引：8

周宁文燕平刘玮《情报学报》2003,22(4):468-471

本文主要讨论了文本型文献信息可视化方法,就图符标识法、高维空间描述法、群集映射法、主题场景法,自组织算法等进行了具体讨论;并对非文本型文献信息的可视化也进行了初步探讨. 相似文献

17.

数字出版环境下的信息资源采集研究现状与展望

何坚石《江西图书馆学刊》2010,40(3):19-22

对数字出版环境下信息资源采集的研究现状进行了分析,探讨了数字出版环境下信息资源采集的影响、采集原则、采集策略及采集技术等方面的研究趋势。相似文献

18.

试论多媒体信息检索的发展

丁大可李树青《图书馆论坛》2004,24(6):204-206,272

文章通过对文本信息检索语言发展的分析，得出检索语言发展的一般规律，由此对多媒体信息检索语言的发展做出回顾和总结，同时对于多媒体信息检索语言的前景进行了展望。相似文献

19.

公共信息资源管理研究综述 总被引：2，自引：0，他引：2

杨玉麟赵冰谷秀洁《图书与情报》2009,(1)

文章通过文献回顾,概述了国内外公共信息资源概念的界定、原则和分类,以及部分学术研究成果.分析了四种语境下的"公共信息",揭示了造成国内外认识差异的历史因素,并以英国政府网站为例介绍了国外公共信息资源的-种分类和管理方式. 相似文献

20.

复合型Web信息检索系统 总被引：5，自引：0，他引：5

向桂林《情报学报》2003,22(5):545-549

本文首先分析了常见的三种搜索引擎 :基于内容分析的搜索引擎、基于超链分析的搜索引擎、基于反馈分析的搜索引擎的弊端 ,提出了一种能够集三种搜索引擎优点于一身的复合型Web信息检索系统 ,并详细阐述了该系统的实现方法相似文献