首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Soft-constrained inference for Named Entity Recognition
Authors:E Fersini  E Messina  G Felici  D Roth
Institution:1. DISCo, University of Milano – Bicocca, Viale Sarca, 336, 20126 Milano, Italy;2. CNR, Institute for Systems Analysis and Computer Science, Viale Manzoni, 30, 00185 Roma, Italy;3. Department of Computer Science, University of Illinois at Urbana–Champaign, 2700 Prairie Meadow Dr., Champaign, IL 61822, United States
Abstract:Much of the valuable information in supporting decision making processes originates in text-based documents. Although these documents can be effectively searched and ranked by modern search engines, actionable knowledge need to be extracted and transformed in a structured form before being used in a decision process. In this paper we describe how the discovery of semantic information embedded in natural language documents can be viewed as an optimization problem aimed at assigning a sequence of labels (hidden states) to a set of interdependent variables (textual tokens). Dependencies among variables are efficiently modeled through Conditional Random Fields, an indirected graphical model able to represent the distribution of labels given a set of observations. The Markov property of these models prevent them to take into account long-range dependencies among variables, which are indeed relevant in Natural Language Processing. In order to overcome this limitation we propose an inference method based on Integer Programming formulation of the problem, where long distance dependencies are included through non-deterministic soft constraints.
Keywords:Conditional Random Fields  Named Entity Recognition  Rule extraction  Integer linear programming
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号