Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration


Wentland, Wolodja ; Knopp, Johannes ; Silberer, Carina ; Hartung, Matthias



Additional URL: http://www.lrec-conf.org/proceedings/lrec2008/summ...
Document Type: Conference or workshop publication
Year of publication: 2008
Book title: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
Date of the conference: 28.-30.05.2008
Author/Publisher of the book
(only the first ones mentioned)
:
Calzolar, Nicoletta
Place of publication: Marrakech, Marocco
Publishing house: European Language Resources Association
ISBN: 2-9517408-4-0
Publication language: English
Institution: School of Business Informatics and Mathematics > Wissensrepräsentation u. Wissensmanagement (Juniorprofessur) (Stuckenschmidt 2005-2008)
Subject: 004 Computer science, internet
Keywords (English): Named Entity recognition, Lexicon, lexical database, Multilinguality
Abstract: In this paper, we present HeiNER, the multilingual Heidelberg Named Entity Resource. HeiNER contains 1,547,586 disambiguated English Named Entities together with translations and transliterations to 15 languages. Our work builds on the approach described in (Bunescu and Pasca, 2006), yet extends it to a multilingual dimension. Translating Named Entities into the various target languages is carried out by exploiting crosslingual information contained in the online encyclopedia Wikipedia. In addition, HeiNER provides linguistic contexts for every NE in all target languages which makes it a valuable resource for multilingual Named Entity Recognition, Disambiguation and Classification. The results of our evaluation against the assessments of human annotators yield a high precision of 0.95 for the NEs we extract from the English Wikipedia. These source language NEs are thus very reliable seeds for our multilingual NE translation method.

Dieser Datensatz wurde nicht während einer Tätigkeit an der Universität Mannheim veröffentlicht, dies ist eine Externe Publikation.




+ Citation Example and Export

Wentland, Wolodja ; Knopp, Johannes ; Silberer, Carina ; Hartung, Matthias Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration. Calzolar, Nicoletta In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08) (2008) Marrakech, Marocco [Conference or workshop publication]


+ Search Authors in

+ Page Views

Hits per month over past year

Detailed information



You have found an error? Please let us know about your desired correction here: E-Mail


Actions (login required)

Show item Show item