Semi-supervised learning for automatic conceptual property extraction

Colin Kelly, Barry Devereux, Anna Korhonen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

For a given concrete noun concept, humans are usually able to cite properties (e.g., elephant is animal, car has wheels) of that concept; cognitive psychologists have theorised that such properties are fundamental to understanding the abstract mental representation of concepts in the brain. Consequently, the ability to automatically extract such properties would be of enormous benefit to the field of experimental psychology. This paper investigates the use of semi-supervised learning and support vector machines to automatically extract concept-relation-feature triples from two large corpora (Wikipedia and UKWAC) for concrete noun concepts. Previous approaches have relied on manually-generated rules and hand-crafted resources such as WordNet; our method requires neither yet achieves better performance than these prior approaches, measured both by comparison with a property norm-derived gold standard as well as direct human evaluation. Our technique performs particularly well on extracting features relevant to a given concept, and suggests a number of promising areas for future focus.
Original languageEnglish
Title of host publicationProceedings of the 2012 Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2012 at the 2012 Conference of the North American Chapter of the Association for Computational Linguistics
Subtitle of host publicationHuman Language Technologies, NAACL-HLT 2012
EditorsDavid Reitter, David Reitter, Roger Levy
Place of PublicationStroudsburg, PA, USA
PublisherAssociation for Computational Linguistics
Pages11-20
Number of pages10
ISBN (Electronic)1937284204, 9781937284206
Publication statusPublished - 2012
Externally publishedYes
Event3rd Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2012 at the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2012 - Montreal, Canada
Duration: 07 Jun 2012 → …

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
Volume2012-June
ISSN (Print)0736-587X

Conference

Conference3rd Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2012 at the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2012
Country/TerritoryCanada
CityMontreal
Period07/06/2012 → …

Bibliographical note

Funding Information:
This research was supported by EPSRC grant EP/F030061/1. We are grateful to McRae and colleagues for making their norms publicly available, and to the anonymous reviewers for their helpful input.

Publisher Copyright:
© 2012 Association for Computational Linguistics

ASJC Scopus subject areas

  • Computer Science Applications
  • Linguistics and Language
  • Language and Linguistics

Fingerprint

Dive into the research topics of 'Semi-supervised learning for automatic conceptual property extraction'. Together they form a unique fingerprint.

Cite this