Building a corpus-derived gazetteer for named entity recognition

Zamin, N. and Oxley, A. (2011) Building a corpus-derived gazetteer for named entity recognition. Communications in Computer and Information Science, 180 CC (PART 2). pp. 73-80. ISSN 18650929

Full text not available from this repository.
Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

Gazetteers, or entity dictionaries, are an important element for Named Entity Recognition. Named Entity Recognition is an essential component of Information Extraction. Gazetteers work as specialized dictionaries to support initial tagging. They provide quick entity identification thus creating richer document representation. However, the compilation of such gazetteers is sometimes mentioned as a stumbling block in Named Entity Recognition. Machine learning, both rule-based and look-up based approaches, are often used to perform this process. In this paper, a gazetteer developed from MUC-3 annotated data for the 'person named' entity type is presented. The process used has a small computational cost. We combine rule-based grammars and a simple filtering technique for automatically inducing the gazetteer. We conclude with experiments to compare the content of the gazetteer with the manually crafted one. © 2011 Springer-Verlag.

Item Type: Article
Additional Information: cited By 5; Conference of 2nd International Conference on Software Engineering and Computer Systems, ICSECS 2011 ; Conference Date: 27 June 2011 Through 29 June 2011; Conference Code:85603
Uncontrolled Keywords: Computational costs; Document Representation; Entity identification; Essential component; Filtering technique; Gazetteer; Information Extraction; Named entity recognition; NAtural language processing; Rule based; Stumbling blocks, Computational linguistics; Learning algorithms; Software engineering, Natural language processing systems
Depositing User: Mr Ahmad Suhairi UTP
Date Deposited: 09 Nov 2023 15:50
Last Modified: 09 Nov 2023 15:50
URI: https://khub.utp.edu.my/scholars/id/eprint/1993

Actions (login required)

View Item
View Item