Joint Author Sentiment Topic Model

Traditional works in sentiment analysis and aspect rating prediction do not take author preferences and writing style into account during rating prediction of reviews. In this work, we introduce Joint Author Sentiment Topic Model (JAST), a generative process of writing a review by an author. Authors have different topic preferences, 'emotional' attachment to topics, writing style based on the distribution of semantic (topic) and syntactic (background) words and their tendency to switch topics. JAST uses Latent Dirichlet Allocation to learn the distribution of author-specific topic preferences and emotional attachment to topics. It uses a Hidden Markov Model to capture short range syntactic and long range semantic dependencies in reviews to capture coherence in author writing style. JAST jointly discovers the topics in a review, author preferences for the topics, topic ratings as well as the overall review rating from the point of view of an author. To the best of our knowledge, this is the first work in Natural Language Processing to bring all these dimensions together to have an author-specific generative model of a review.

Publications

  • Subhabrata Mukherjee, Gaurab Basu and Sachindra Joshi.
    Joint Author Sentiment Topic Model
    Proc. of the 2014 SIAM International Conference on Data Mining (SDM). 2014.
    PDF SLIDES BIB

Downloads

Dataset used in the SDM 2014 paper: