首页 | 本学科首页   官方微博 | 高级检索  
     检索      

“互联网用作语料库”的原理与实践
引用本文:丁政.“互联网用作语料库”的原理与实践[J].洛阳师范学院学报,2008,27(2):93-95.
作者姓名:丁政
作者单位:洛阳师范学院外国语学院,河南洛阳,471022
摘    要:"互联网用作语料库"是一种把互联网上的文本用作语料资源的新兴方法。互联网并非标准意义的语料库,但因包含庞大数量的文本而有具有不可忽视的实用价值。"互联网用作语料库"方法已广泛服务于语言数据挖掘以及语言学假设检验。目前已有数种专门化检索工具问世,同时直接应用通用型搜索引擎搜集语料是应用最广泛的方法。本文介绍"互联网用作语料库"的发展现状、基础理论、基本原理、应用策略与手段。

关 键 词:互联网用作语料库  语料库  检索  搜索引擎  Google

Theory and Application of "Web as a Corpus"
DING Zheng.Theory and Application of "Web as a Corpus"[J].Journal of Luoyang Teachers College,2008,27(2):93-95.
Authors:DING Zheng
Institution:DING Zheng (Foreign Languages College, Luoyang Normal University, Luoyang 471022, China)
Abstract:"Web as a corpus" is a newly developed method of exploiting the vast reservoir of web texts.While the web is not an archetypal corpus,"web as a corpus" method is irrefutably functional,and has found its widespread applications in linguistic data retrieval and linguistic hypothesis testing.A variety of specialized instruments have been developed.And the use of web search engines,such as Google,is also a feasible and widely adopted solution.This paper presents theories and technical issues on "web as a corpus"method and elaborates on its application strategies and solutions.
Keywords:Google
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号