Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Sentiment classification of Chinese online reviews: analysing and improving supervised machine learning

Sentiment classification of Chinese online reviews: analysing and improving supervised machine... With the boost of online reviews, a large quantity of consumers’ opinions on certain products and services are generated and spread over the internet, thus techniques of sentiment classification for online reviews rise in response to the requirement of retrieving valuable information. This paper is mainly focused on improving sentiment classification of Chinese online reviews through analysing and improving each step in supervised machine learning. At first, adjectives, adverbs, and verbs are selected as the initial text features. Then, three statistic methods (DF, IG and CHI) are utilised to extract features. At last, a Boolean method is applied to set weight to features and a support vector machine (SVM) is employed as the classifier. Several comparative experiments have been conducted on reviews of two domains: mobile phone (product) reviews and hotel (service) reviews. The experimental results indicate that part of speech (POS), the number of features, evaluation domain, feature extraction algorithm and kernel function of SVM have great influences on sentiment classification, while the number of training corpora has a little impact. In addition, further improvements of DF IG and CHI have been made, which demonstrate the theoretical significance and the practical value of this research. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png International Journal of Web Engineering and Technology Inderscience Publishers

Sentiment classification of Chinese online reviews: analysing and improving supervised machine learning

Loading next page...
 
/lp/inderscience-publishers/sentiment-classification-of-chinese-online-reviews-analysing-and-obESOJ0NOW
Publisher
Inderscience Publishers
Copyright
Copyright © Inderscience Enterprises Ltd. All rights reserved
ISSN
1476-1289
eISSN
1741-9212
DOI
10.1504/IJWET.2012.050968
Publisher site
See Article on Publisher Site

Abstract

With the boost of online reviews, a large quantity of consumers’ opinions on certain products and services are generated and spread over the internet, thus techniques of sentiment classification for online reviews rise in response to the requirement of retrieving valuable information. This paper is mainly focused on improving sentiment classification of Chinese online reviews through analysing and improving each step in supervised machine learning. At first, adjectives, adverbs, and verbs are selected as the initial text features. Then, three statistic methods (DF, IG and CHI) are utilised to extract features. At last, a Boolean method is applied to set weight to features and a support vector machine (SVM) is employed as the classifier. Several comparative experiments have been conducted on reviews of two domains: mobile phone (product) reviews and hotel (service) reviews. The experimental results indicate that part of speech (POS), the number of features, evaluation domain, feature extraction algorithm and kernel function of SVM have great influences on sentiment classification, while the number of training corpora has a little impact. In addition, further improvements of DF IG and CHI have been made, which demonstrate the theoretical significance and the practical value of this research.

Journal

International Journal of Web Engineering and TechnologyInderscience Publishers

Published: Jan 1, 2012

There are no references for this article.