请输入您要查询的百科知识:

 

词条 Named entity
释义

  1. References

In information extraction, a named entity is a real-world object, such as persons, locations, organizations, products, etc., that can be denoted with a proper name. It can be abstract or have a physical existence. Examples of named entities include Barack Obama, New York City, Volkswagen Golf, or anything else that can be named. Named entities can simply be viewed as entity instances (e.g., New York City is an instance of a city).

From a historical perspective, the term Named Entity was coined during the MUC-6 evaluation campaign[1] and contained ENAMEX (entity name expressions e.g. persons, locations and organizations) and NUMEX (numerical expression).

A more formal definition can be derived from the rigid designator by Saul Kripke. In the expression "Named Entity", the word "Named" aims to restrict the possible set of entities to only those for which one or many rigid designators stands for the referent.[2] A designator is rigid when it designates the same thing in every possible world. On the contrary, flaccid designators may designate different things in different possible worlds.

As an example, consider the sentence, "Obama is the president of the United States". Both "Obama" and the "United States" are named entities since they refer to specific objects (Barack Obama and United States). However, "president" is not a named entity since it can be used to refer to many different objects in different worlds (in different presidential periods referring to different persons, or even in different countries or organizations referring to different people). Rigid designators usually include proper names as well as certain natural terms like biological species and substances.

There is also a general agreement in the Named Entity Recognition community to consider as named entities temporal and numerical expressions such as amounts of money and other types of units, which may violate the rigid designator perspective.

The task of recognizing named entities in text is Named Entity Recognition while the task of determining the identity of the named entities mentioned in text is called Named Entity Disambiguation. Both tasks require dedicated algorithms and resources to be addressed.[3]

References

1. ^{{cite conference |last1=Grishman |first1=Ralph |first2=Beth |last2=Sundheim |title=Design of the MUC-6 evaluation |conference=TIPSTER '96 Proceedings |year=1996|url=https://aclweb.org/anthology/M/M95/M95-1001.pdf}}
2. ^{{cite conference |last1=Nadeau |first1=David |first2=Satoshi |last2=Sekine |title=A survey of named entity recognition and classification |conference=Lingvisticae Investigationes |year=2007|url=http://nlp.cs.nyu.edu/sekine/papers/li07.pdf}}
3. ^{{cite book|first1=Damien|last1=Nouvel|first2=Maud|last2=Ehrmann|first3=Sophie|last3=Rosset|title=Named Entities for Computational Linguistics|editor=Wiley|year=2015|isbn=978-1-84821-838-3}}

2 : Natural language processing|Computational linguistics

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/11 18:35:51