Enriching semantic knowledge bases for opinion mining in big data applications

Albert Weichselbraun, Stefan Gindl, Arno Scharl

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Abstract This paper presents a novel method for contextualizing and enriching large semantic knowledge bases for opinion mining with a focus on Web intelligence platforms and other high-throughput big data applications. The method is not only applicable to traditional sentiment lexicons, but also to more comprehensive, multi-dimensional affective resources such as SenticNet. It comprises the following steps: (i) identify ambiguous sentiment terms, (ii) provide context information extracted from a domain-specific training corpus, and (iii) ground this contextual information to structured background knowledge sources such as ConceptNet and WordNet. A quantitative evaluation shows a significant improvement when using an enriched version of SenticNet for polarity classification. Crowdsourced gold standard data in conjunction with a qualitative evaluation sheds light on the strengths and weaknesses of the concept grounding, and on the quality of the enrichment process.
Original languageEnglish
Pages (from-to)78 - 85
JournalKnowledge-Based Systems
Volume69
Early online date10 May 2014
DOIs
Publication statusPublished - Oct 2014

Keywords

  • Concept grounding
  • Web intelligence
  • Social Web
  • Big data
  • Knowledge extraction
  • Opinion mining
  • Sentiment analysis
  • Disambiguation
  • Contextualization
  • Common-sense knowledge

Fingerprint

Dive into the research topics of 'Enriching semantic knowledge bases for opinion mining in big data applications'. Together they form a unique fingerprint.

Cite this