TY - GEN
T1 - Latent Dirichlet Markov allocation for sentiment analysis
AU - Bagheri, Ayoub
AU - Saraee, Mohamad
AU - de Jong, Franciska M.G.
PY - 2013/7
Y1 - 2013/7
N2 - In recent years probabilistic topic models have gained tremendous attention in data mining and natural language processing research areas. In the field of information retrieval for text mining, a variety of probabilistic topic models have been used to analyse content of documents. A topic model is a generative model for documents, it specifies a probabilistic procedure by which documents can be generated. All topic models share the idea that documents are mixture of topics, where a topic is a probability distribution over words. In this paper we describe Latent Dirichlet Markov Allocation Model (LDMA), a new generative probabilistic topic model, based on Latent Dirichlet Allocation (LDA) and Hidden Markov Model (HMM), which emphasizes on extracting multi-word topics from text data. LDMA is a four-level hierarchical Bayesian model where topics are associated with documents, words are associated with topics and topics in the model can be presented with single- or multi-word terms. To evaluate performance of LDMA, we report results in the field of aspect detection in sentiment analysis, comparing to the basic LDA model.
AB - In recent years probabilistic topic models have gained tremendous attention in data mining and natural language processing research areas. In the field of information retrieval for text mining, a variety of probabilistic topic models have been used to analyse content of documents. A topic model is a generative model for documents, it specifies a probabilistic procedure by which documents can be generated. All topic models share the idea that documents are mixture of topics, where a topic is a probability distribution over words. In this paper we describe Latent Dirichlet Markov Allocation Model (LDMA), a new generative probabilistic topic model, based on Latent Dirichlet Allocation (LDA) and Hidden Markov Model (HMM), which emphasizes on extracting multi-word topics from text data. LDMA is a four-level hierarchical Bayesian model where topics are associated with documents, words are associated with topics and topics in the model can be presented with single- or multi-word terms. To evaluate performance of LDMA, we report results in the field of aspect detection in sentiment analysis, comparing to the basic LDA model.
KW - EWI-23676
KW - METIS-300002
KW - IR-87440
M3 - Conference contribution
SN - 0903440547
SP - 90
EP - 96
BT - The Fifth European Conference on Intelligent Management Systems in Operations (IMSIO 5)
PB - ThinkLab, University of Salford
CY - Salford, UK
T2 - The Fifth European Conference on Intelligent Management Systems in Operations (IMSIO 5) , Salford, UK
Y2 - 1 July 2013
ER -