共查询到20条相似文献,搜索用时 78 毫秒
1.
The profusion of online resources calls for tools and methods to help Internet users find precisely what they are looking for. Quality controlled gateway CISMeF provides such services for health resources. However, the human cost of maintaining and updating the catalogue are increasingly high. This paper presents the automatic indexing system currently developed in the CISMeF team to be used as such for preliminary indexing, or after human reviewing for the final indexing. The system architecture, using the INTEX platform for MeSH term extraction is detailed. The results of a first evaluation tend to indicate that the automatic indexing strategy is relevant, as it achieves a precision comparable to that of other existing operational systems. Moreover, the system presented in this paper retrieves keyword/qualifier pairs as opposed to single terms, therefore providing a significantly more precise indexing. Further development and tests will be carried out in order to improve the coverage of the dictionaries, and validate the efficiency of the system in the indexers’ everyday work. 相似文献
2.
3.
单汉字索引是中文全文检索索引技术中一个主要方法,此方法在索引的空问和检索的效率方面都存在不足。本文引入单元词索引,并分析试验数据,表明引入单元词索引后,索引的空间效率和检索的时间效率均有提高。 相似文献
4.
5.
一个基于本体论全文自动标引方案 总被引:5,自引:1,他引:5
本文为支持数字图书馆全文检索精度的提高,提出了一个基于本体论全文自动标引方案。该方案利用本体论的方法,强调词与词之间的内在概念联系,着重解决传统的人工标引不能全面概括全文,而且词与词之间缺乏概念性的连接,很难反映文件主题的全面内容及由于多义词、同义词等的原因造成漏检或检索结果返回信息太多,失去检索意义,达不到理想效果的问题。并为数字图书馆在进行主题标引时实现自动化操作。 相似文献
6.
[目的/意义]基于文本挖掘技术自动发现更具代表性的文献内容主题词,通过定位主题词在章节中的具体位置,并基于可视化技术进行主题标引,帮助读者直观高效发现文献主题间的潜在关系。[方法/过程]基于文本挖掘技术深入文献内容层挖掘主题词,并利用可视化工具直观呈现所获信息,在此基础上尝试构建可视化主题自动标引系统,并在格萨尔领域的多个主题中对该系统的自动标引效果进行验证。[结果/结论]研究结果显示,该标引方法在格萨尔领域实现了文献内容级的可视化主题自动标引,快速精准地定位到章节、段落和句子。标引相关信息获取过程直观可视,并且具有交互性,可提升用户体验和参与度。文章以《英雄格萨尔》为例完成系统验证,但该标引方法技术本身无领域限定,可应用于其他领域的文献。 相似文献
7.
[目的/意义]资源数字化时代文献服务向知识服务方向转变,高质量的文献自动标引是文献知识服务能力提升的基础和关键,针对目前英文科技文献自动标引准确率不高的问题,提出了基于语义感知的概念遴选优化方法。[方法/过程]基于知识组织系统的自动主题标引,采用自然语言处理中的神经网络词向量技术,对概念和英文文献内容语义进行表示并进行语义感知与评估,实现概念标引结果在语义层面的遴选。该方法采用基于知识组织系统与自然语言处理技术相结合的方法,弥补了在语义层面上的不足,从而进一步降低不相关概念的影响,提高概念标引结果的准确率。[结果/结论]实验结果表明,该方法具有较好的语义感知性能,在概念遴选上有效降低了不相关概念,大大提高了标引结果的文献相关性,为科技文献资源知识化服务建设和相关研究提供有价值的参考和支持。 相似文献
8.
医学文献光盘数据库主题标引规律的探讨 总被引:3,自引:0,他引:3
In order to search information more effectively, this essay compares CBMdisc and MEDLINE for their similarities and differences in subject indexing. It also puts forward some suggestions about subject indexing of Chinese medical literatures. 相似文献
9.
10.
语义检索能克服传统的基于关键词匹配检索的缺点,是信息检索的发展趋势。本文主要探讨两种实现语义检索的索引:潜语义索引和其修正形式。首先介绍了潜语义索引的基本思想和检索过程,并在分析潜语义索引的不足的基础上,介绍了其修正形式———残差迭代变换。 相似文献
11.
12.
文献数据库的知识处理与科技项目查新——OAV法在查新检索中的应用探讨 总被引:4,自引:0,他引:4
本文讨论文献数据库中的知识表达、标引问题,试图将知识工程中的OAV三元组法引入科技项目查新咨询工作中以解决科研主题、成果评审中的创新性评价问题。 相似文献
13.
14.
This paper deals with Swedish full text retrieval and the problem of morphological variation of query terms in the document database. The effects of combination of indexing strategies with query terms on retrieval effectiveness were studied. Three of five tested combinations involved indexing strategies that used conflation, in the form of normalization. Further, two of these three combinations used indexing strategies that employed compound splitting. Normalization and compound splitting were performed by SWETWOL, a morphological analyzer for the Swedish language. A fourth combination attempted to group related terms by right hand truncation of query terms. The four combinations were compared to each other and to a baseline combination, where no attempt was made to counteract the problem of morphological variation of query terms in the document database. The five combinations were evaluated under six different user scenarios, where each scenario simulated a certain user type. The four alternative combinations outperformed the baseline, for each user scenario. The truncation combination had the best performance under each user scenario. The main conclusion of the paper is that normalization and right hand truncation (performed by a search expert) enhanced retrieval effectiveness in comparison to the baseline. The performance of the three combinations of indexing strategies with query terms based on normalization was not far below the performance of the truncation combination. 相似文献
15.
16.
CBMdisc主题标引一致性的探讨 总被引:7,自引:0,他引:7
通过一个发表在两种不同期刊上的同一文献在CBMdisc中标引不一致的例子,分析了造成主题标引不一致的原因,并就减少这种主题标引不一致提出了几点建议。 相似文献
17.
CNKI主题标引分析 总被引:2,自引:0,他引:2
现今网络数据库中文献量日益增大,用户使用量日渐膨胀,需求也愈发急切。怎样准确的提供给用户所需文献成为人们非常重视的问题。对于期刊论文来说,主题标引的高质量是准确提供给用户所需文献的前提与关键。本文选定信息管理学科的6个主题词在CNKI中进行主题检索,通过分析检索结果来评价CNKI的主题标引质量,分析原因并提出改进建议。 相似文献
18.
19.
The paper argues that the theory that opens out the nature of subject indexing process is required, if the process consists of a tremor of steps and elements and it is a range of interpretations, too. The paper points out that Peirce' s semiotics is the theory by which we may study and understand the lmture of interpretation in subject indexing pro-cess. In recent years, there is a great interest in semiofcs in library and information science field. The paper discusses some basic issues of semiotics and its use in information science. 相似文献
20.
A variety of abstract automatic indexing models have been developed in recent times in an effort to produce indexing methods that are both effective and usable in practice. Among these are the term discrimination model and the term precision system. These two indexing systems are briefly described and experimental evidence is cited showing that a combination of both theories produces better retrieval performance than either one alone. Appropriate conclusions are reached concerning viable automatic indexing procedures usable in practice. 相似文献