Abstract
Abstract This paper presents a novel method for contextualizing and enriching large semantic knowledge bases for opinion mining with a focus on Web intelligence platforms and other high-throughput big data applications. The method is not only applicable to traditional sentiment lexicons, but also to more comprehensive, multi-dimensional affective resources such as SenticNet. It comprises the following steps: (i) identify ambiguous sentiment terms, (ii) provide context information extracted from a domain-specific training corpus, and (iii) ground this contextual information to structured background knowledge sources such as ConceptNet and WordNet. A quantitative evaluation shows a significant improvement when using an enriched version of SenticNet for polarity classification. Crowdsourced gold standard data in conjunction with a qualitative evaluation sheds light on the strengths and weaknesses of the concept grounding, and on the quality of the enrichment process.
Original language | English |
---|---|
Pages (from-to) | 78 - 85 |
Journal | Knowledge-Based Systems |
Volume | 69 |
Early online date | 10 May 2014 |
DOIs | |
Publication status | Published - Oct 2014 |
Keywords
- Concept grounding
- Web intelligence
- Social Web
- Big data
- Knowledge extraction
- Opinion mining
- Sentiment analysis
- Disambiguation
- Contextualization
- Common-sense knowledge