Abstract
Recently, one-stage detectors have achieved competitive accuracy and faster speed compared with traditional two-stage detectors on image data. However, in the field of video object detection (VOD), most existing VOD methods are still based on two-stage detectors. Moreover, directly adapting existing VOD methods to one-stage detectors introduces unaffordable computational costs. In this paper, we first analyse the computational bottlenecks of using one-stage detectors for VOD. Based on the analysis, we present a simple yet efficient framework to address the computational bottlenecks and achieve efficient one-stage VOD by exploiting the temporal consistency in video frames. Specifically, our method consists of a location prior network to filter out background regions and a size prior network to skip unnecessary computations on low-level feature maps for specific frames. We test our method on various modern one-stage detectors and conduct extensive experiments on the ImageNet VID dataset. Excellent experimental results demonstrate the superior effectiveness, efficiency, and compatibility of our method. The code is available at https://github.com/guanxiongsun/EOVOD .
Original language | English |
---|---|
Title of host publication | Proceeding of the 17th European Conference on Computer Vision |
Editors | Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner |
Publisher | Springer Nature Switzerland AG |
Pages | 1-16 |
Volume | XXXV |
ISBN (Electronic) | 9783031198335 |
ISBN (Print) | 9783031198328 |
DOIs | |
Publication status | Published - 04 Nov 2022 |
Event | European Conference on Computer Vision - Israel, Tel-Aviv, Israel Duration: 23 Oct 2022 → 27 Oct 2022 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 13695 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | European Conference on Computer Vision |
---|---|
Abbreviated title | ECCV 2022 |
Country/Territory | Israel |
City | Tel-Aviv |
Period | 23/10/2022 → 27/10/2022 |
Fingerprint
Dive into the research topics of 'Efficient one-stage video object detection by exploiting temporal consistency'. Together they form a unique fingerprint.Student theses
-
Towards effective and efficient video object detection
Author: Sun, G., Jul 2023Supervisor: Hua, Y. (Supervisor), Wang, H. (Supervisor) & Robertson, N. (Supervisor)
Student thesis: Doctoral Thesis › Doctor of Philosophy
File