首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Integration of an XML electronic dictionary with linguistic tools for natural language processing
Authors:Octavio Santana Suárez  Francisco J Carreras Riudavets  Zenón Hernández Figueroa  Antonio C González Cabrera
Institution:Department of Informática y Sistemas, Edificio de Informática y Matemáticas, Campus Universitario de Tafira, Universidad de Las Palmas de Gran Canaria, 35017 Las Palmas, Spain
Abstract:This study proposes the codification of lexical information in electronic dictionaries, in accordance with a generic and extendable XML scheme model, and its conjunction with linguistic tools for the processing of natural language. Our approach is different from other similar studies in that we propose XML coding of those items from a dictionary of meanings that are less related to the lexical units. Linguistic information, such as morphology, syllables, phonology, etc., will be included by means of specific linguistic tools. The use of XML as a container for the information allows the use of other XML tools for carrying out searches or for enabling presentation of the information in different resources. This model is particularly important as it combines two parallel paradigms—extendable labelling of documents and computational linguistics—and it is also applicable to other languages. We have included a comparison with the labelling proposal of printed dictionaries carried out by the Text Encoding Initiative (TEI). The proposed design has been validated with a dictionary of more than 145 000 accepted meanings.
Keywords:Encoding  Dictionary  XML  Computational linguistics
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号