Abstract
Analysts are often interested in how sentiment towards an organization, a product or a particular technology changes over time. Popular methods that process unstructured textual material to automatically detect sentiment based on tagged dictionaries are not capable of fulfilling this task, even when coupled with part-of speech tagging, a standard component of most text processing toolkits that distinguishes grammatical categories such as article, noun, verb, and adverb. Small corpus size, ambiguity and subtle incremental change of tonal expressions between different versions of a document complicate sentiment detection. Parsing grammatical structures, by contrast, outperforms dictionary-based approaches in terms of reliability, but usually suffers from poor scalability due to its computational complexity. This work provides an over view of different dictionary- and machine-learning-based sentiment detection methods and evaluates them on several Web corpora.After identifying the shortcomings of these methods, the paper proposes an approach based on automatically building Tagged Linguistic Unit (TLU) databases to overcome the restrictions of dictionaries with a limited set of tagged tokens.
Original language | English |
---|---|
Title of host publication | Networked Knowledge - Networked Media |
Place of Publication | Berlin, Germany |
Publisher | Springer |
Pages | 217-234 |
Number of pages | 18 |
ISBN (Electronic) | 978-3-642-02184-8 |
ISBN (Print) | 978-3-642-02183-1 |
DOIs | |
Publication status | Published - 2009 |
Keywords
- Information Systems and Communication Service
- Appl.Mathematics/Computational Methods of Engineering
- Artificial Intelligence (incl. Robotics)
- Information Systems Applications (incl. Internet)