首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A machine learning based framework to identify unseen classes in open-world text classification
Institution:1. School of Information Management, Central China Normal University, Wuhan, 430079, China;3. Center for Studies of Information Resources, Wuhan University, Wuhan, 430072, China;4. Institute of Medical Information, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100020, China;5. School of Information Management, Wuhan University, Wuhan, 430072, China
Abstract:Classical supervised machine learning (ML) follows the assumptions of closed-world learning. However, this assumption does not work in an open-world dynamic environment. Therefore, the automated systems must be able to discover and identify unseen instances. Open-world ML can deal with unseen instances and classes through a two-step process: (1) discover and classify unseen instances and (2) identify novel classes discovered in step (1). Most existing research on open-world machine learning (OWML) only focuses on step 1. However, performing step 2 is required to build intelligent systems. The proposed framework comprises three different but interconnected modules that discover and identify unseen classes. Our in-depth performance evaluation establishes that the proposed framework improves open accuracy by up to 8% compared to the state-of-the-art models.
Keywords:Open-world machine learning  Unseen instances  Novel classes  Neighborhood blending  Intelligent applications  Open text classification
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号