Unsupervised Solution Post Identification from Discussion Forums

Deepak Padmanabhan, Karthik Visweswariah

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)
226 Downloads (Pure)


Discussion forums have evolved into a dependablesource of knowledge to solvecommon problems. However, only a minorityof the posts in discussion forumsare solution posts. Identifying solutionposts from discussion forums, hence, is animportant research problem. In this paper,we present a technique for unsupervisedsolution post identification leveraginga so far unexplored textual feature, thatof lexical correlations between problemsand solutions. We use translation modelsand language models to exploit lexicalcorrelations and solution post characterrespectively. Our technique is designedto not rely much on structural featuressuch as post metadata since suchfeatures are often not uniformly availableacross forums. Our clustering-based iterativesolution identification approach basedon the EM-formulation performs favorablyin an empirical evaluation, beatingthe only unsupervised solution identificationtechnique from literature by a verylarge margin. We also show that our unsupervisedtechnique is competitive againstmethods that require supervision, outperformingone such technique comfortably.
Original languageEnglish
Title of host publicationProceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
PublisherAssociation for Computational Linguistics
Number of pages10
Publication statusPublished - 2014
Event52nd Annul Meeting of the Association of Computational Linguistics 2014 - Maryland, Baltimore, United States
Duration: 22 Jun 201427 Jun 2014


Conference52nd Annul Meeting of the Association of Computational Linguistics 2014
Country/TerritoryUnited States


Dive into the research topics of 'Unsupervised Solution Post Identification from Discussion Forums'. Together they form a unique fingerprint.

Cite this