首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
The profusion of online resources calls for tools and methods to help Internet users find precisely what they are looking for. Quality controlled gateway CISMeF provides such services for health resources. However, the human cost of maintaining and updating the catalogue are increasingly high. This paper presents the automatic indexing system currently developed in the CISMeF team to be used as such for preliminary indexing, or after human reviewing for the final indexing. The system architecture, using the INTEX platform for MeSH term extraction is detailed. The results of a first evaluation tend to indicate that the automatic indexing strategy is relevant, as it achieves a precision comparable to that of other existing operational systems. Moreover, the system presented in this paper retrieves keyword/qualifier pairs as opposed to single terms, therefore providing a significantly more precise indexing. Further development and tests will be carried out in order to improve the coverage of the dictionaries, and validate the efficiency of the system in the indexers’ everyday work.  相似文献   

2.
浅谈主题标引规范化   总被引:3,自引:0,他引:3  
刘鲁红 《情报理论与实践》2004,27(4):367-368,404
主题标引是一项长期性、基础性工作。本文主要阐述了主题标引工作中标引过程的规范化、标引方式的规范化,并就目前标引工作中存在的问题加以分析,提出了几点对策。  相似文献   

3.
单汉字索引是中文全文检索索引技术中一个主要方法,此方法在索引的空问和检索的效率方面都存在不足。本文引入单元词索引,并分析试验数据,表明引入单元词索引后,索引的空间效率和检索的时间效率均有提高。  相似文献   

4.
网页自动标引方案的优选及标引性能的测评   总被引:2,自引:0,他引:2  
仲云云  侯汉清  薛鹏军 《情报科学》2002,20(10):1108-1110
本文介绍了三种网页自动标引方案,通过对“中国经济网”上50页网页的手工标引、自动标引结果比较,从而优选出一种方案,即对网页全文不同部位加权,采用词频加权统计法。最后对该方案自动主题标引和分类标引分别从人机相符率方面进行测评。  相似文献   

5.
一个基于本体论全文自动标引方案   总被引:5,自引:1,他引:5  
王泰森 《情报科学》2003,21(9):950-952
本文为支持数字图书馆全文检索精度的提高,提出了一个基于本体论全文自动标引方案。该方案利用本体论的方法,强调词与词之间的内在概念联系,着重解决传统的人工标引不能全面概括全文,而且词与词之间缺乏概念性的连接,很难反映文件主题的全面内容及由于多义词、同义词等的原因造成漏检或检索结果返回信息太多,失去检索意义,达不到理想效果的问题。并为数字图书馆在进行主题标引时实现自动化操作。  相似文献   

6.
[目的/意义]基于文本挖掘技术自动发现更具代表性的文献内容主题词,通过定位主题词在章节中的具体位置,并基于可视化技术进行主题标引,帮助读者直观高效发现文献主题间的潜在关系。[方法/过程]基于文本挖掘技术深入文献内容层挖掘主题词,并利用可视化工具直观呈现所获信息,在此基础上尝试构建可视化主题自动标引系统,并在格萨尔领域的多个主题中对该系统的自动标引效果进行验证。[结果/结论]研究结果显示,该标引方法在格萨尔领域实现了文献内容级的可视化主题自动标引,快速精准地定位到章节、段落和句子。标引相关信息获取过程直观可视,并且具有交互性,可提升用户体验和参与度。文章以《英雄格萨尔》为例完成系统验证,但该标引方法技术本身无领域限定,可应用于其他领域的文献。  相似文献   

7.
孟旭阳  白海燕  梁冰  王莉 《情报杂志》2021,40(3):125-131,7
[目的/意义]资源数字化时代文献服务向知识服务方向转变,高质量的文献自动标引是文献知识服务能力提升的基础和关键,针对目前英文科技文献自动标引准确率不高的问题,提出了基于语义感知的概念遴选优化方法。[方法/过程]基于知识组织系统的自动主题标引,采用自然语言处理中的神经网络词向量技术,对概念和英文文献内容语义进行表示并进行语义感知与评估,实现概念标引结果在语义层面的遴选。该方法采用基于知识组织系统与自然语言处理技术相结合的方法,弥补了在语义层面上的不足,从而进一步降低不相关概念的影响,提高概念标引结果的准确率。[结果/结论]实验结果表明,该方法具有较好的语义感知性能,在概念遴选上有效降低了不相关概念,大大提高了标引结果的文献相关性,为科技文献资源知识化服务建设和相关研究提供有价值的参考和支持。  相似文献   

8.
医学文献光盘数据库主题标引规律的探讨   总被引:3,自引:0,他引:3  
In order to search information more effectively, this essay compares CBMdisc and MEDLINE for their similarities and differences in subject indexing. It also puts forward some suggestions about subject indexing of Chinese medical literatures.  相似文献   

9.
10.
语义检索能克服传统的基于关键词匹配检索的缺点,是信息检索的发展趋势。本文主要探讨两种实现语义检索的索引:潜语义索引和其修正形式。首先介绍了潜语义索引的基本思想和检索过程,并在分析潜语义索引的不足的基础上,介绍了其修正形式———残差迭代变换。  相似文献   

11.
一种基于本体的语义标引方法   总被引:4,自引:0,他引:4  
传统的采用主题词和关键词对文档进行标引的方法,由于不能提供语义推理而越来越不适合目前的网络环境。由于本体具有良好的概念层次结构和对逻辑推理的支持,在信息检索领域将有很大的应用价值。本文首先介绍本体的基本概念和领域本体的组成部分,然后提出了一种基于领域本体的语义标引方法,采用本体中的概念对文档进行语义层面的标引,为检索的智能推理提供基础。  相似文献   

12.
曹锦丹  刘鑫 《情报科学》2000,18(3):253-255
本文讨论文献数据库中的知识表达、标引问题,试图将知识工程中的OAV三元组法引入科技项目查新咨询工作中以解决科研主题、成果评审中的创新性评价问题。  相似文献   

13.
钟哲辉 《情报杂志》1994,13(1):32-35
对计算机文献数据库的加权标引和自动检索全过程进行了全面论述,并引入向量矩阵,在理论上实现新的突破;在实践上向逐步实现智能化检索迈出了一大步。  相似文献   

14.
This paper deals with Swedish full text retrieval and the problem of morphological variation of query terms in the document database. The effects of combination of indexing strategies with query terms on retrieval effectiveness were studied. Three of five tested combinations involved indexing strategies that used conflation, in the form of normalization. Further, two of these three combinations used indexing strategies that employed compound splitting. Normalization and compound splitting were performed by SWETWOL, a morphological analyzer for the Swedish language. A fourth combination attempted to group related terms by right hand truncation of query terms. The four combinations were compared to each other and to a baseline combination, where no attempt was made to counteract the problem of morphological variation of query terms in the document database. The five combinations were evaluated under six different user scenarios, where each scenario simulated a certain user type. The four alternative combinations outperformed the baseline, for each user scenario. The truncation combination had the best performance under each user scenario. The main conclusion of the paper is that normalization and right hand truncation (performed by a search expert) enhanced retrieval effectiveness in comparison to the baseline. The performance of the three combinations of indexing strategies with query terms based on normalization was not far below the performance of the truncation combination.  相似文献   

15.
高维索引技术是基于内容的图像检索中的一项关键技术。本文分析了图像检索中索引技术的研究现状,对现有的索引方法进行了分类、比较和评价,最后对存在的问题和发展方向进行了探讨。  相似文献   

16.
CBMdisc主题标引一致性的探讨   总被引:7,自引:0,他引:7  
秦东 《现代情报》2006,26(1):95-96
通过一个发表在两种不同期刊上的同一文献在CBMdisc中标引不一致的例子,分析了造成主题标引不一致的原因,并就减少这种主题标引不一致提出了几点建议。  相似文献   

17.
CNKI主题标引分析   总被引:2,自引:0,他引:2  
现今网络数据库中文献量日益增大,用户使用量日渐膨胀,需求也愈发急切。怎样准确的提供给用户所需文献成为人们非常重视的问题。对于期刊论文来说,主题标引的高质量是准确提供给用户所需文献的前提与关键。本文选定信息管理学科的6个主题词在CNKI中进行主题检索,通过分析检索结果来评价CNKI的主题标引质量,分析原因并提出改进建议。  相似文献   

18.
李培 《情报科学》1999,17(6):676-678,690
本文对二值独立性标引模型、DIA模型和2—Poisson模型三种典型的概率标引模型进行了研究,分析了其原理和处理过程,评价了其性能。  相似文献   

19.
The paper argues that the theory that opens out the nature of subject indexing process is required, if the process consists of a tremor of steps and elements and it is a range of interpretations, too. The paper points out that Peirce' s semiotics is the theory by which we may study and understand the lmture of interpretation in subject indexing pro-cess. In recent years, there is a great interest in semiofcs in library and information science field. The paper discusses some basic issues of semiotics and its use in information science.  相似文献   

20.
A variety of abstract automatic indexing models have been developed in recent times in an effort to produce indexing methods that are both effective and usable in practice. Among these are the term discrimination model and the term precision system. These two indexing systems are briefly described and experimental evidence is cited showing that a combination of both theories produces better retrieval performance than either one alone. Appropriate conclusions are reached concerning viable automatic indexing procedures usable in practice.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号