Abstract
The Latent Dirichlet Allocation model is an unsupervised generative model that is widely used for topic modelling in text. We propose to add supervision to the model in the form of domain knowledge to direct the focus of topics to more relevant aspects than the topics produced by standard LDA. Experimental results demonstrate the effectiveness of our method. We also propose a novel Twofold-LDA model to improve the current output of LDA in order to visualize results in graphical form, which can ultimately be used by potential customers. Experiments show the benefit of this new output, with the ability to produce topics focused on our desired aspects in a user friendly chart.
Original language | English |
---|---|
Title of host publication | Unknown Host Publication |
Place of Publication | United States |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 253-256 |
Number of pages | 4 |
Volume | 1 |
ISBN (Print) | 978-1-4577-1373-6 |
DOIs | |
Publication status | Published - 2011 |