Abstract
Recent research shows the potential of utilizing data collected through Web 2.0 applications to capture changes in a domain's terminology. This paper presents an approach to augment corpus-based ontology learning by considering terms from collaborative tagging systems, social networking platforms, and micro-blogging services. The proposed framework collects information on the domain's terminology from domain documents and a seed ontology in a triple store. Data from social sources such as Delicious, Flickr, Technorati and Twitter provide an outside view of the domain and help incorporate external knowledge into the ontology learning process. The neural network technique of spreading activation is used to identify relevant new concepts, and to determine their positions in the extended ontology. Evaluating the method with two measures (PMI and expert judgements) demonstrates the significant benefits of social evidence sources for ontology learning.
Original language | Undefined/Unknown |
---|---|
Title of host publication | 23rd International Workshop on Database and Expert Systems Applications |
Publisher | DEXA 2010 |
Pages | 193-197 |
Number of pages | 5 |
ISBN (Electronic) | 978-0-7695-4174-7 |
DOIs | |
Publication status | Published - 2010 |
Event | 23rd International Workshop on Database and Expert Systems Applications (2010) - Spain, Bilbao, Spain Duration: 30 Aug 2010 → 3 Sept 2010 |
Conference
Conference | 23rd International Workshop on Database and Expert Systems Applications (2010) |
---|---|
Country/Territory | Spain |
City | Bilbao |
Period | 30/08/2010 → 03/09/2010 |
Keywords
- Evidence Source Integration
- Ontology Learning
- Spreading Activation
- Social Evidence Source
- Web 2.0