Semi-supervised learning for automatic conceptual property extraction

Colin Kelly, Barry Devereux, Anna Korhonen

Research output: Chapter in Book/Report/Conference proceedingConference contribution


For a given concrete noun concept, humans are usually able to cite properties (e.g., elephant is animal, car has wheels) of that concept; cognitive psychologists have theorised that such properties are fundamental to understanding the abstract mental representation of concepts in the brain. Consequently, the ability to automatically extract such properties would be of enormous benefit to the field of experimental psychology. This paper investigates the use of semi-supervised learning and support vector machines to automatically extract concept-relation-feature triples from two large corpora (Wikipedia and UKWAC) for concrete noun concepts. Previous approaches have relied on manually-generated rules and hand-crafted resources such as WordNet; our method requires neither yet achieves better performance than these prior approaches, measured both by comparison with a property norm-derived gold standard as well as direct human evaluation. Our technique performs particularly well on extracting features relevant to a given concept, and suggests a number of promising areas for future focus.
Original languageEnglish
Title of host publicationProceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics
Place of PublicationStroudsburg, PA, USA
PublisherAssociation for Computational Linguistics
Number of pages10
Publication statusPublished - 2012

Publication series

NameCMCL '12
PublisherAssociation for Computational Linguistics

Bibliographical note

bibtex: kelly2012

Fingerprint Dive into the research topics of 'Semi-supervised learning for automatic conceptual property extraction'. Together they form a unique fingerprint.

Cite this