英文科技文献内核识别方法研究 Research on Recognition of Core Content of English Scientific Literature期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

英文科技文献内核识别方法研究

引用本文：	祝清松,冷伏海,王林,韩涛. 英文科技文献内核识别方法研究[J]. 情报理论与实践, 2012, 35(9): 112-116

作者姓名：	祝清松冷伏海王林韩涛

作者单位：	1. 中国科学院国家科学图书馆,北京100190 中国科学院研究生院,北京100049 2. 中国科学院国家科学图书馆,北京,100190

基金项目：	国家自然科学基金项目“科技创新演化分析理论与方法研究”(项目编号:70873123);中国科学院文献情报新增能力项目“面向‘未来科技竞争力’分析方法和工具研究”的成果

摘要：	针对英文科技文献的特征,提出一种规则和统计相结合的关键内容识别方法。该方法首先通过对源文档进行特征标识,将其转换成更易于处理的中间文档;然后利用特征还原、线索词匹配、主题识别和临近分析等,从中间文档抽取代表文本的主要信息,生成目标文档。该方法能够有效地辅助科研人员阅读大量的英文科技文献,提高阅读效率。
关键词：	特征标识线索词匹配主题识别临近分析
Research on Recognition of Core Content of English Scientific Literature

Affiliation:	Zhu Qingsong et al.

Abstract:	Based on the features of the English scientific literatures,this paper proposes a method of combining rules with statistics to recognize key content.The method firstly recognizes the features of the source document and turns it into the intermediary document which can be processed more easily.Then,through features recovery,clue word matching,topic recognition and proximal analysis,the method creates the target document by extracting the main information representing the document from the intermediary document.The method can effectively help the scientific research personnel read lots of English scientific literatures and improve their reading efficiency.

Keywords:	feature recognition clue word matching topic recognition proximal analysis
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏