Hashing-based undersampling for large scale histopathology image classification

  • Xing Tian
  • , Lin Qiu
  • , Qihua Li
  • , Wing W.Y. Ng
  • , Jianjun Zhang
  • , Sam Kwong
  • , Hui Wang
  • , Xinran Dong
  • , Baoyi Liu
  • , Yijun Hu
  • , Honghua Yu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The early diagnosis of cancer based on histopathology images plays an important role in medical science. Existing techniques generally partition the original histopathology image into small pieces for further classification. However, due to the fact that the number of benign (majority) samples is much larger than that of malignant (minority) samples, the classification is significantly imbalanced which adversely affects classification performance. Undersampling is commonly used to address the class-imbalance problem. However, existing methods are typically time consuming so they are not suitable to handle large-scale and high-dimensional data. In this paper we propose a fast and scalable undersampling method, hashing-based undersampling (HBU), to address class imbalance in large-scale medical image classification. Benign images are hashed and then placed into different buckets according to their locations in the input space. Undersampling is achieved by proportionally selecting benign images from the hash buckets. The HBU method is experimentally evaluated on two real histopathology image datasets, CAMELYON16 and ACDC@LUNGHP, by comparison with existing methods. Experimental results show that the HBU method outperforms six state-of-The-Art methods and is scalable and fast.

Original languageEnglish
Title of host publicationProceedings of 2022 IEEE 21st International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)
EditorsYingxu Wang, Konstantin N. Plataniotis, Bernard Widrow, Witold Pedrycz, Witold Kinsner, Petros Spachos, Sam Kwong
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages221-228
Number of pages8
ISBN (Electronic)9781665490849
DOIs
Publication statusPublished - 21 Apr 2023
Event21st IEEE International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC 2022 - Toronto, Canada
Duration: 08 Dec 202210 Dec 2022

Publication series

NameProceedings of IEEE International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC

Conference

Conference21st IEEE International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC 2022
Country/TerritoryCanada
CityToronto
Period08/12/202210/12/2022

Bibliographical note

Funding Information:
This work was supported in part by the National Natural Science Foundation of China under Grants 62202175, 61876066, 61772344, and 61672443, the Science and Technology Planning Project of Guangzhou (SL2023A04J01464), the 67th Chinese Postdoctoral Science Foundation (2020M672631), the Key-Area Research and Development of Guangdong Province under Grant 2020B010166002, EU Horizon 2020 Programme (700381, ASGARD), the Hong Kong RGC General Research Funds under Grant 9042489 (CityU 11206317), Grant 9042816 (CityU 11209819) and Grant 9042322 (CityU 11200116), and the Hong Kong Innovation and Technology Commission (InnoHK Project CIMDA). Wing W. Y. Ng, Yijun Hu and Honghua Yu are corresponding authors of this work.

Publisher Copyright:
© 2022 IEEE.

Keywords

  • Cancer diagnosis
  • Class-imbalance
  • Histopathology image
  • Undersampling

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Information Systems
  • Signal Processing
  • Decision Sciences (miscellaneous)
  • Information Systems and Management
  • Cognitive Neuroscience

Fingerprint

Dive into the research topics of 'Hashing-based undersampling for large scale histopathology image classification'. Together they form a unique fingerprint.

Cite this