Exploring English lexicon knowledge for Chinese sentiment analysis

He, Yulan; Alani, Harith and Zhou, Deyu (2010). Exploring English lexicon knowledge for Chinese sentiment analysis. In: CIPS-SIGHAN Joint Conference on Chinese Language Processing, 28-29 Aug 2010, Beijing, China.

URL: http://wing.comp.nus.edu.sg/~antho/sighan.html


This paper presents a weakly-supervised method for Chinese sentiment analysis by incorporating lexical prior knowledge obtained from English sentiment lexicons through machine translation. A mechanism is introduced to incorporate the prior information about polarity bearing words obtained from existing sentiment lexicons into latent Dirichlet allocation (LDA) where sentiment labels are considered as topics. Experiments on Chinese product reviews on mobile phones, digital cameras, MP3 players, and monitors demonstrate the feasibility and effectiveness of the proposed approach and show that the weakly supervised LDA model performs as well as supervised classifiers such as Naive Bayes and Support vector Machines with an average of 83% accuracy achieved over a total of 5484 review documents. Moreover, the LDA model is able to extract highly domain-salient polarity words from text.

Viewing alternatives

Download history

Item Actions