NoiseBox: toward more efficient and effective learning with noisy labels

Chen Feng, Georgios Tzimiropoulos, Ioannis Patras

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)
3 Downloads (Pure)

Abstract

Despite the large progress in supervised learning with neural networks, there are significant challenges in obtaining high-quality, large-scale and accurately labelled datasets. In such contexts, how to learn in the presence of noisy labels has received more and more attention. Addressing this relatively intricate problem to attain competitive results predominantly involves designing mechanisms that select samples that are expected to have reliable annotations. However, these methods typically involve multiple off-the-shelf techniques, resulting in intricate structures. Furthermore, they frequently make implicit or explicit assumptions about the noise modes/ratios within the dataset. Such assumptions can compromise model robustness and limit its performance under varying noise conditions. Unlike these methods, in this work, we propose an efficient and effective framework with minimal hyperparameters that achieves SOTA results in various benchmarks. Specifically, we design an efficient and concise training framework consisting of a subset expansion module responsible for exploring non-selected samples and a model training module to further reduce the impact of noise, called NoiseBox. Moreover, diverging from common sample selection methods based on the “small loss” mechanism, we introduce a novel sample selection method based on the neighbouring relationships and label consistency in the feature space. Without bells and whistles, such as model co-training, self-supervised pre-training and semi-supervised learning, and with robustness concerning the settings of its few hyper-parameters, our method significantly surpasses previous methods on both CIFAR10/CIFAR100 with synthetic noise and real-world noisy datasets such as Red Mini-ImageNet, WebVision, Clothing1M and ANIMAL-10N.
Original languageEnglish
Pages (from-to)11914 - 11928
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume34
Issue number11
Early online date11 Jul 2024
DOIs
Publication statusPublished - Nov 2024
Externally publishedYes

Fingerprint

Dive into the research topics of 'NoiseBox: toward more efficient and effective learning with noisy labels'. Together they form a unique fingerprint.

Cite this