Increasing NER recall with minimal precision loss

Jasper Kuperus, Maurice van Keulen, Cor J. Veenman

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

3 Citations (Scopus)
1088 Downloads (Pure)

Abstract

Named Entity Recognition (NER) is broadly used as a first step toward the interpretation of text documents. However, for many applications, such as forensic investigation, recall is currently inadequate, leading to loss of potentially important information. Entity class ambiguity cannot be resolved reliably due to the lack of context information or the exploitation thereof. Consequently, entity classification introduces too many errors, leading to severe omissions in answers to forensic queries. We propose a technique based on multiple candidate labels effectively postponing decisions for entity classification to query time. Entity resolution exploits user feedback: a user is only asked for feedback on entities relevant to his/her query. Moreover, giving feedback can be stopped anytime when query results are considered good enough. We propose several interaction strategies that obtain increased recall with little loss in precision.
Original languageUndefined
Title of host publicationProceedings of the European Intelligence and Security Informatics Conference (EISIC 2013)
Place of PublicationUSA
PublisherIEEE
Pages106-111
Number of pages6
ISBN (Print)978-0-7695-5062-6
DOIs
Publication statusPublished - Aug 2013
EventWorkshop on Forensic Text Analytics (FORTAN 2013), Uppsala, Sweden: Proceedings of the Workshop on Forensic Text Analytics (FORTAN 2013) - USA
Duration: 1 Aug 2013 → …

Publication series

Name
PublisherIEEE Computer Society

Conference

ConferenceWorkshop on Forensic Text Analytics (FORTAN 2013), Uppsala, Sweden
CityUSA
Period1/08/13 → …

Keywords

  • EWI-23400
  • METIS-297674
  • IR-86348

Cite this