The Open UniversitySkip to content
 

Semantic smoothing for Twitter sentiment analysis

Saif, Hassan; He, Yulan and Alani, Harith (2011). Semantic smoothing for Twitter sentiment analysis. In: 10th International Semantic Web Conference (ISWC 2011), 23-27 Oct 2011, Bonn, Germany.

Full text available as:
[img]
Preview
PDF (Version of Record) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (188kB) | Preview
URL: http://iswc2011.semanticweb.org/fileadmin/iswc/Pap...
Google Scholar: Look up in Google Scholar

Abstract

Twitter has brought much attention recently as a hot research topic in the domain of sentiment analysis. Training sentiment classifier from tweets data often faces the data sparsity problem partly due to the large variety of short forms introduced to tweets because of the 140-character limit. In this work we propose using semantic smoothing to alleviate the data sparseness problem. Our approach extracts semantically hidden concepts from the training documents and then incorporates these concepts as additional features for classifier training. We tested our approach using two different methods. One is shallow semantic smoothing where words are replaced with their corresponding semantic concepts; another is to interpolate the original unigram language model in the Naive Bayes NB classifier with the generative model of words given semantic concepts. Preliminary results show that with shallow semantic smoothing the vocabulary size has been reduced by 20%. Moreover, the interpolation method improves upon shallow semantic smoothing by over 5% in sentiment classification and slightly outperforms NB trained on unigrams only without semantic smoothing.

Item Type: Conference or Workshop Item
Copyright Holders: 2011 The Authors
Project Funding Details:
Funded Project NameProject IDFunding Body
ROBUSTROBUSTEU
Academic Unit/School: Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Research Group: Centre for Research in Computing (CRC)
Related URLs:
Item ID: 38502
Depositing User: Harith Alani
Date Deposited: 26 Sep 2013 09:57
Last Modified: 11 May 2019 18:12
URI: http://oro.open.ac.uk/id/eprint/38502
Share this page:

Download history for this item

These details should be considered as only a guide to the number of downloads performed manually. Algorithmic methods have been applied in an attempt to remove automated downloads from the displayed statistics but no guarantee can be made as to the accuracy of the figures.

Actions (login may be required)

Policies | Disclaimer

© The Open University   contact the OU