Abstract
Temporality has significantly contributed to various Natural Language Processing and Information Retrieval applications. In this article, we first create a lexical knowledge-base in Hindi by identifying the temporal orientation of word senses based on their definition and then use this resource to detect underlying temporal orientation of the sentences. To create the resource, we propose a semi-supervised learning framework, where each synset of the Hindi WordNet is classified into one of the five categories, namely, past, present, future, neutral, and atemporal. The algorithm initiates learning with a set of seed synsets and then iterates following different expansion strategies, viz. probabilistic expansion based on classifier's confidence and semantic distance based measures. We manifest the usefulness of the resource that we build on an external task, viz. sentence-level temporal classification. The underlying idea is that a temporal knowledge-base can help in classifying the sentences according to their inherent temporal properties. Experiments on two different domains, viz. general and Twitter, show interesting results.
| Original language | English |
|---|---|
| Article number | 19 |
| Number of pages | 22 |
| Journal | ACM Transactions on Asian and Low-Resource Language Information Processing |
| Volume | 18 |
| Issue number | 2 |
| Early online date | 14 Dec 2018 |
| DOIs | |
| Publication status | Published - Jun 2019 |
| Externally published | Yes |
Bibliographical note
Funding Information:Asif Ekbal acknowledges Young Faculty Research Fellowship (YFRF), supported by Visvesvaraya PhD scheme for Electronics and IT, Ministry of Electronics and Information Technology (MeitY), Government of India, being implemented by Digital India Corporation (formerly Media Lab Asia). Authors’ addresses: S. Kamila, A. Ekbal, and P. Bhattacharyya, Department of Computer Science and Engineering, Indian Institute of Technology Patna, Bihar, India, 801106; emails: {sabysachi.pcs16, asif, pb}@iitp.ac.in; M. Hasanuzzaman, School of Computing, Dublin City University, Dublin, Ireland; email: [email protected]. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]. © 2018 Association for Computing Machinery. 2375-4699/2018/12-ART19 $15.00 https://doi.org/10.1145/3277504
Publisher Copyright:
© 2018 Association for Computing Machinery.
Keywords
- Hindi
- Semi-supervised machine learning
- Sentence-level temporality detection
- Temporal sense detection
ASJC Scopus subject areas
- General Computer Science