TY - GEN
T1 - An unsupervised aspect detection model for sentiment analysis of reviews
AU - Bagheri, Ayoub
AU - Saraee, M.
AU - de Jong, Franciska M.G.
N1 - 10.1007/978-3-642-38824-8_12
PY - 2013/6
Y1 - 2013/6
N2 - With the rapid growth of user-generated content on the internet, sentiment analysis of online reviews has become a hot research topic recently, but due to variety and wide range of products and services, the supervised and domain-specific models are often not practical. As the number of reviews expands, it is essential to develop an efficient sentiment analysis model that is capable of extracting product aspects and determining the sentiments for aspects. In this paper, we propose an unsupervised model for detecting aspects in reviews. In this model, first a generalized method is proposed to learn multi-word aspects. Second, a set of heuristic rules is employed to take into account the influence of an opinion word on detecting the aspect. Third a new metric based on mutual information and aspect frequency is proposed to score aspects with a new bootstrapping iterative algorithm. The presented bootstrapping algorithm works with an unsupervised seed set. Finally two pruning methods based on the relations between aspects in reviews are presented to remove incorrect aspects. The proposed model does not require labeled training data and can be applicable to other languages or domains. We demonstrate the effectiveness of our model on a collection of product reviews dataset, where it outperforms other techniques.
AB - With the rapid growth of user-generated content on the internet, sentiment analysis of online reviews has become a hot research topic recently, but due to variety and wide range of products and services, the supervised and domain-specific models are often not practical. As the number of reviews expands, it is essential to develop an efficient sentiment analysis model that is capable of extracting product aspects and determining the sentiments for aspects. In this paper, we propose an unsupervised model for detecting aspects in reviews. In this model, first a generalized method is proposed to learn multi-word aspects. Second, a set of heuristic rules is employed to take into account the influence of an opinion word on detecting the aspect. Third a new metric based on mutual information and aspect frequency is proposed to score aspects with a new bootstrapping iterative algorithm. The presented bootstrapping algorithm works with an unsupervised seed set. Finally two pruning methods based on the relations between aspects in reviews are presented to remove incorrect aspects. The proposed model does not require labeled training data and can be applicable to other languages or domains. We demonstrate the effectiveness of our model on a collection of product reviews dataset, where it outperforms other techniques.
KW - HMI-SLT: Speech and Language Technology
KW - METIS-297709
KW - IR-86480
KW - EWI-23462
U2 - 10.1007/978-3-642-38824-8_12
DO - 10.1007/978-3-642-38824-8_12
M3 - Conference contribution
SN - 978-3-642-38823-1
T3 - Lecture Notes in Computer Science
SP - 140
EP - 151
BT - Proceedings of the 18th International Conference on Applications of Natural Language to Information Systems, NLDB 2013
PB - Springer
CY - London
T2 - 18th International Conference on Applications of Natural Language to Information Systems, NLDB 2013
Y2 - 19 June 2013 through 21 June 2013
ER -