基于DOM的Web信息自动抽取 Automatic Web Information Extraction Based on DOM期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于DOM的Web信息自动抽取

引用本文：	吴伟,刘友华. 基于DOM的Web信息自动抽取[J]. 现代图书情报技术, 2004, 20(2): 68-71

作者姓名：	吴伟刘友华

作者单位：	南京大学信息管理系,南京,210093

摘要：	提出了Web页面信息的自动抽取思想，并使用WebBrowser和DOM技术实现了Web页面上网页元素查找、表单自动填写、表单自动提交、自动获得查询结果并自动抽取所需信息的技术，从而实现了Web页面信息的自动抽取。文中还给出了这一方法的实现细节和示例代码。
关键词：	Web页面自动信息抽取 DOM WebBrowser
收稿时间：	2003-09-15
修稿时间：	2003-09-15
Automatic Web Information Extraction Based on DOM

Wu Wei Liu Youhua. Automatic Web Information Extraction Based on DOM[J]. New Technology of Library and Information Service, 2004, 20(2): 68-71

Authors:	Wu Wei Liu Youhua

Affiliation:	(Department of Information Management, Nanjing University,Nanjing 210093,China)

Abstract:	More and more Web sites are built on database -driven architecture. The Web pages of these sites are creating dynamically. This paper advances and implements a method of automatic information extraction from the dynamic pages by using WebBrowser and DOM technique. In addition, the paper illustrates the details and code through a prototype.

Keywords:	Dynamic Web Automatic information extraction DOM WebBrowser
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《现代图书情报技术》浏览原始摘要信息
	点击此处可从《现代图书情报技术》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏