Abstract
For a given concrete noun concept, humans are usually able to cite properties (e.g., elephant is animal, car has wheels) of that concept; cognitive psychologists have theorised that such properties are fundamental to understanding the abstract mental representation of concepts in the brain. Consequently, the ability to automatically extract such properties would be of enormous benefit to the field of experimental psychology. This paper investigates the use of semi-supervised learning and support vector machines to automatically extract concept-relation-feature triples from two large corpora (Wikipedia and UKWAC) for concrete noun concepts. Previous approaches have relied on manually-generated rules and hand-crafted resources such as WordNet; our method requires neither yet achieves better performance than these prior approaches, measured both by comparison with a property norm-derived gold standard as well as direct human evaluation. Our technique performs particularly well on extracting features relevant to a given concept, and suggests a number of promising areas for future focus.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2012 Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2012 at the 2012 Conference of the North American Chapter of the Association for Computational Linguistics |
Subtitle of host publication | Human Language Technologies, NAACL-HLT 2012 |
Editors | David Reitter, David Reitter, Roger Levy |
Place of Publication | Stroudsburg, PA, USA |
Publisher | Association for Computational Linguistics |
Pages | 11-20 |
Number of pages | 10 |
ISBN (Electronic) | 1937284204, 9781937284206 |
Publication status | Published - 2012 |
Externally published | Yes |
Event | 3rd Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2012 at the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2012 - Montreal, Canada Duration: 07 Jun 2012 → … |
Publication series
Name | Proceedings of the Annual Meeting of the Association for Computational Linguistics |
---|---|
Volume | 2012-June |
ISSN (Print) | 0736-587X |
Conference
Conference | 3rd Workshop on Cognitive Modeling and Computational Linguistics, CMCL 2012 at the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2012 |
---|---|
Country/Territory | Canada |
City | Montreal |
Period | 07/06/2012 → … |
Bibliographical note
Funding Information:This research was supported by EPSRC grant EP/F030061/1. We are grateful to McRae and colleagues for making their norms publicly available, and to the anonymous reviewers for their helpful input.
Publisher Copyright:
© 2012 Association for Computational Linguistics
ASJC Scopus subject areas
- Computer Science Applications
- Linguistics and Language
- Language and Linguistics