首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Research of Chinese intangible cultural heritage knowledge graph construction and attribute value extraction with graph attention network
Institution:1. School of Information Management, Nanjing University, Nanjing 210023, China;2. Jiangsu Key Laboratory of Data Engineering and Knowledge Service, Nanjing 210023, China;1. Business School, Hohai University, Nanjing 211100, China;2. Foreign Language School, Hohai University, Nanjing 211100, China;1. Department of Information Science and Technology, South China Business College, Guangdong University of Foreign Studies, Guangzhou 510545, China;2. Department of Computer and Information Science, University of Macau, Macau 999078, China;3. Department of Information and Communication Engineering, Guangzhou Maritime University, Guangzhou 510725, China;1. School of Economics and Management, Harbin Engineering University, Harbin 150001, China;2. Management School, Harbin University of Commerce, Harbin 150028, China;3. Department of Computer Science and Information Engineering, Asia University, Taichung, 41354, Taiwan;4. Department of Computer Science and Engineering, Kyung Hee University, Republic of Korea
Abstract:The development of digital technology promotes the construction of the Intangible cultural heritage (ICH) database but the data is still unorganized and not linked well, which makes the public hard to master the overall knowledge of the ICH. An ICH knowledge graph (KG) can help the public to understand the ICH and facilitate the protection of the ICH. However, a general framework of ICH KG construction is lacking now. In this study, we take the Chinese ICH (nation-level) as an example and propose a framework to build a Chinese ICH KG combining multiple data sources from Baike and the official website, which can extend the scale of the KG. Besides, the data of ICH grows daily, requiring us to design an efficient model to extract the knowledge from the data to update the KG in time. The built KG is based on the triple 〈entity, attribute, attribute value〉 and we introduce the attribute value extraction (AVE) task. However, the public Chinese ICH annotated AVE corpus is lacking. To solve that, we construct a Chinese ICH AVE corpus based on the Distant Supervision (DS) automatically rather than employing traditional manual annotation. Currently, AVE is usually seen as the sequence tagging task. In this paper, we take the ICH AVE as a node classification task and propose an AVE model BGC, combining the BiLSTM and graph attention network, which can fuse and utilize the word-level and character-level information by means of the ICH lexicon generated from the KG. We conduct extensive experiments and compare the proposed model with other state-of-the-art models. Experimental results show that the proposed model is of superiority.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号