Skip to main navigation Skip to search Skip to main content

To Extend or Not to Extend? Context-Specific Corpus Enrichment

  • Felix Kuhr
  • , Tanya Braun
  • , Magnus Bender
  • , R. Möller

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

An agent in pursuit of a task may work with a corpus of documents with linked subjective content descriptions. Faced with a new document, an agent has to decide whether to include that document in its corpus or not. Basing the decision on only words, topics, or entities, has shown to not lead to a balanced performance for varying documents. Therefore, this paper presents an approach for an agent to decide if a new document adds value to its existing corpus by combining texts and content descriptions. Furthermore, an agent can use the approach as a starting point for high quality content descriptions for new documents. A case study shows the effectiveness of our approach given varying types of new documents.
Original languageEnglish
Title of host publicationAI 2019 : Advances in Artificial Intelligence - 32nd Australasian Joint Conference, 2019, Proceedings
EditorsJixue Liu, James Bailey
Number of pages12
Volume11919
Publication date25 Nov 2019
Pages357-368
ISBN (Print)9783030352875
DOIs
Publication statusPublished - 25 Nov 2019
Externally publishedYes
EventAustralasian Joint Conference: Advances in Artificial Intelligence - Adelaide, Australia
Duration: 2 Dec 20195 Dec 2019
Conference number: 32

Conference

ConferenceAustralasian Joint Conference
Number32
Country/TerritoryAustralia
CityAdelaide
Period02/12/201905/12/2019

Keywords

  • Named Entity Recognition
  • Semantics
  • Text mining
  • Subjective content description
  • Recommender Systems
  • Models
  • Entailment
  • Embedding

Citation Styles