Online residual-based key frame sampling with self-coach mechanism and adaptive multi-level feature fusion

Rui Zhang, Yang Hua, Tao Song, Zhengui Xue, Ruhui Ma*, Haibing Guan

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Key frame sampling is a common component in video tasks. Putting more effort into key frames, rather than processing all frames equally, can significantly reduce computational costs and improve processing efficiency. This paper presents ORSampler, an adaptive Online Residual-based key frame Sampler. ORSampler relies on feature residuals to sample key frames and decouples from subsequent video tasks. To facilitate ORSampler, a self-coached mechanism is designed to speed up learning, and an adaptive multi-level feature fusion is proposed to fit the diversity of subsequent video tasks. OR-Sampler has a fast inference speed and can work online. Extensive experiments on two typical video tasks verify the effectiveness and generality of our proposed ORSampler.

Original languageEnglish
Title of host publicationICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP): Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728163277
ISBN (Print)9781728163284
DOIs
Publication statusPublished - 05 May 2023
Event48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023 - Rhodes Island, Greece
Duration: 04 Jun 202310 Jun 2023

Publication series

NameIEEE International Conference on Acoustics, Speech and Signal Processing: Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISSN (Print)1520-6149
ISSN (Electronic)2379-190X

Conference

Conference48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
Country/TerritoryGreece
CityRhodes Island
Period04/06/202310/06/2023

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • key frame sampling
  • non-uniform sampling
  • reinforcement learning
  • Video signal processing

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Online residual-based key frame sampling with self-coach mechanism and adaptive multi-level feature fusion'. Together they form a unique fingerprint.

Cite this