首页 | 本学科首页   官方微博 | 高级检索  
     

电子政务领域中文术语层次关系识别研究
引用本文:张卫,王昊,邓三鸿,张宝隆. 电子政务领域中文术语层次关系识别研究[J]. 情报学报, 2021, 40(1): 62-76
作者姓名:张卫  王昊  邓三鸿  张宝隆
作者单位:南京大学信息管理学院,南京 210023;江苏省数据工程与知识服务重点实验室(南京大学),南京 210023;南京大学信息管理学院,南京 210023;江苏省数据工程与知识服务重点实验室(南京大学),南京 210023;南京大学信息管理学院,南京 210023;江苏省数据工程与知识服务重点实验室(南京大学),南京 210023;南京大学信息管理学院,南京 210023;江苏省数据工程与知识服务重点实验室(南京大学),南京 210023
基金项目:国家社科基金重大招标项目“情报学学科建设与情报工作未来发展路径研究”(17ZDA291);2019年江苏省研究生科研创新计划“面向‘互联网+政务’的微政务本体构建与服务研究”(KYCX19_0884)。
摘    要:数据驱动下,与日俱增的电子政务信息资源愈发表现出多源异构的特性,基于大规模语料设计一套电子政务领域内中文术语深度层次关系的自动化识别方案,不仅有利于从内容与结构层面弥补人工构建领域词表的不足,且对于我国政务信息资源的开放共享与后续应用更具有重大现实意义。因此,本文分别基于内容与结构双重视角识别电子政务主题词表内术语间的深层关联,通过谱聚类生成的基于内容的层次关系为初步框架,凭借形式概念分析生成的基于结构的层次关系为后期修正指导,以期构成兼顾关联术语召回率与准确率的电子政务领域术语本体。研究结果显示,电子政务术语本体的层次结构合理有效,且术语层次关系的评价结果表明知识本体具备良好的扩展性和延伸性。

关 键 词:电子政务术语  层次关系  本体  谱聚类  形式概念分析

Research on Hierarchy Identification of Chinese Terms in the Field of E-government
Zhang Wei,Wang Hao,Deng Sanhong,Zhang Baolong. Research on Hierarchy Identification of Chinese Terms in the Field of E-government[J]. Journal of the China Society for Scientific andTechnical Information, 2021, 40(1): 62-76
Authors:Zhang Wei  Wang Hao  Deng Sanhong  Zhang Baolong
Affiliation:(School of Information Management,Nanjing University,Nanjing 210023;Jiangsu Key Laboratory of Data Engineering and Knowledge Service,Nanjing University,Nanjing 210023)
Abstract:Driven by data,the characteristics of multi-source heterogeneity have been demonstrated among an increasing number of e-government information resources.Utilizing a large-scale corpus to design an automatic identification scheme for the deep hierarchy of Chinese terms in the field of e-government not only compensates for the lack of a man-made thesaurus in terms of content and structure but also has great practical significance for the dissemination and subsequent application of government information resources in China.Therefore,in line with the dual perspectives of content and structure,the deep associations between terms in the e-government thesaurus are identified in this paper.The content-based hierarchy that is generated by spectral clustering is utilized as the preliminary framework and the structure-based hierarchy that is generated by formal concept analysis is employed as the later modification guide,in order to form the ontology of e-government terms that accounts for the recall and accuracy rates of related terms.The results reveal that the hierarchy of the ontology of e-government terminology is reasonable and effective,and the evaluation results of the term hierarchies illustrate that the knowledge ontology has excellent expansibility and extensibility.
Keywords:e-government term  hierarchy  ontology  spectral clustering  formal concept analysis
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号