一种网上图书信息抽取方法 A Method for Book Information Extraction from Web期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

一种网上图书信息抽取方法

引用本文：	李向阳,张亚非.一种网上图书信息抽取方法[J].情报学报,2004,23(6):655-660.

作者姓名：	李向阳张亚非

作者单位：	解放军理工大学通信工程学院,南京,210007

摘要：	提出一种基于竞争分类的网上图书信息抽取方法 ,以信息片段与样本之间的相似度作为竞争力 ,通过信息片段对信息模板槽的竞争来实现信息片段的分类和噪声信息的过滤 ,直接从分类的角度抽取图书信息。相对基于规则的信息抽取方法 ,在用户标记样本较少的情况下 ,竞争分类法更能适应数据项顺序变化较大或有数据项缺失的数据源 ,适用于从不同的图书数据源集成图书信息
关键词：	信息抽取竞争分类特征提取数据集成
修稿时间：	2004年3月17日
A Method for Book Information Extraction from Web

Li Xiangyang and Zhang Yafei.A Method for Book Information Extraction from Web[J].Journal of the China Society for Scientific andTechnical Information,2004,23(6):655-660.

Authors:	Li Xiangyang and Zhang Yafei

Abstract:	We present a competing classification method to extract book information from Web.The method uses similarity between information fragments and samples as competing ability.It classifies fragments and filters noise information through competition of fragments for template slots.It needs far less tagged samples than those using rules to extract information.The method is adaptive to data sources having items of various orders and missing items and can be applied to integrate book information from various Web sites.

Keywords:	informatoin extraction competing classification feature extraction data integration
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏