Name variants for improving entity discovery and linking

Albert Weichselbraun, Philip Kuntschik, Adrian Brasoveanu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Identifying all names that refer to a particular set of named entities is a challenging task, as quite often we need to consider many features that include a lot of variation like abbreviations, aliases, hypocorism, multilingualism or partial matches. Each entity type can also have specific rules for
name variances: people names can include titles, country and branch names are sometimes removed from organization names, while locations are often plagued by the issue of nested entities. The lack of a clear strategy for collecting, processing and computing name variants significantly lowers the
recall of tasks such as Named Entity Linking and Knowledge Base Population since name variances are frequently used in all kind of textual content.
This paper proposes several strategies to address these issues. Recall can be improved by combining knowledge repositories and by computing additional variances based on algorithmic approaches. Heuristics and machine learning methods then analyze the generated name variances and mark ambiguous names to increase precision. An extensive evaluation demonstrates the effects
of integrating these methods into a new Named Entity Linking framework and confirms that systematically considering name variances yields significant performance improvements.
Original languageEnglish
Title of host publicationProceedings of LDK 2019 (OASICS, Vol.70)
Pages14:1-14:15
Volume70
ISBN (Electronic)978-3-95977-105-4
Publication statusPublished - 2019
Event2nd Conference on Language, Data and Knowledge (LDK 2019) - , Germany
Duration: 20 May 201923 May 2019

Conference

Conference2nd Conference on Language, Data and Knowledge (LDK 2019)
Country/TerritoryGermany
Period20/05/201923/05/2019

Fingerprint

Dive into the research topics of 'Name variants for improving entity discovery and linking'. Together they form a unique fingerprint.

Cite this