Cascaded one-vs-rest detection network for fine-grained recognition without part annotations

Long Chen, Shengke Wang*, Kin Man Lam, Huiyu Zhou, Muwei Jian, Junyu Dong

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Fine-grained recognition is a challenging task due to small intra-category variances. Most of the top-performing fine-grained recognition methods leverage parts of objects for better performance. Therefore, part annotations which are extremely computationally expensive are required. In this paper, we propose a novel cascaded deep CNN detection framework for fine-grained recognition which is trained to detect a whole object without considering parts. Nevertheless, most of the current top-performing detection networks use N + 1 class (N object categories plus background) softmax loss. The background category with much more training samples dominates the feature learning progress where the features are not suitable for object categorisation with fewer samples. To address this issue, we here introduce two strategies: 1) We leverage a cascaded structure to eliminate the background. 2) We introduce a novel one-vs-rest loss function to capture more minute variances from different subordinate categories. Experiments show that our proposed recognition framework achieves comparable performance against the state-of-the-art, part-free, fine-grained recognition methods on the CUB-200-2011 Bird dataset. Meanwhile, our method outperforms most of the existing part annotation based methods and does not need part annotations at the training stage whilst being free from any annotations at the test stage.

Original languageEnglish
Pages (from-to)4381-4395
Number of pages15
JournalMultimedia Tools and Applications
Volume78
Issue number4
Early online date17 Mar 2018
DOIs
Publication statusPublished - 01 Feb 2019
Externally publishedYes

Bibliographical note

Funding Information:
Acknowledgments 2014DFA10410. H. Zhou is supported by UK EPSRC under Grant EP/N011074/1 and Royal Society-Newton Advanced Fellowship under Grant NA160342.

Publisher Copyright:
© 2018, Springer Science+Business Media, LLC, part of Springer Nature.

Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.

Keywords

  • Detection
  • Fine-grained Recognition
  • One-vs-rest
  • Without part annotations

ASJC Scopus subject areas

  • Software
  • Media Technology
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Cascaded one-vs-rest detection network for fine-grained recognition without part annotations'. Together they form a unique fingerprint.

Cite this