首页 | 本学科首页   官方微博 | 高级检索  
     

异构专利数据源集成方案设计与实现
引用本文:翟东升,禾文汇. 异构专利数据源集成方案设计与实现[J]. 现代图书情报技术, 2010, 26(9): 67-73
作者姓名:翟东升  禾文汇
作者单位:北京工业大学经济管理学院 北京 100124
基金项目:本文系北京市自然科学基金“知识产权预警机制信息服务平台研究”(项目编号:9092002)研究成果之一。
摘    要:针对目前用于专利分析的数据存在来源单一、预处理操作不够、可挖掘程度浅等问题,设计并实现异构专利数据源集成方案,即从七国两组织的专利数据库获取数据到本地专利数据库;以本地数据库为基础数据源,利用SSIS工具通过ETL(数据抽取-数据转换-数据装载)操作,生成规范的、集成的高质量数据;进而将其加载到事先围绕KPI(关键性能指标)分析构建好的专利数据仓库中,从而为专利多维分析以及数据挖掘提供有效的数据支持。

收稿时间:2010-06-28
修稿时间:2010-08-12

Design and Implementation of Data Integration over Heterogeneous Patent Sources
Zhai Dongsheng,He Wenhui. Design and Implementation of Data Integration over Heterogeneous Patent Sources[J]. New Technology of Library and Information Service, 2010, 26(9): 67-73
Authors:Zhai Dongsheng  He Wenhui
Affiliation:School of Economics and Management, Beijing University of Technology, Beijing 100124, China
Abstract:With consideration of the problems concerning the data of patent analysis, such as single data source, rough pretreatment, and low-level data mining, this paper designs and achieves the data integration over heterogeneous patent sources. Specifically, the local patent database where the data are acquired from heterogeneous sources including two organizations and seven countries is regarded as basic data source. After using the SSIS tool for data cleaning and data transformation, the data from local database are loaded into data warehouse that is built according to the key performance indicators, which provides data support for more advantaged analysis.
Keywords:
点击此处可从《现代图书情报技术》浏览原始摘要信息
点击此处可从《现代图书情报技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号