TY - JOUR
T1 - HistoClean: open-source software for histological image pre-processing and augmentation to improve development of robust convolutional neural networks
AU - McCombe, Kris
AU - Craig, Stephanie
AU - Viratham Pulsawatdi, Amelie
AU - Quezada-Marín, Javier Ignacio
AU - Hagan, Matthew
AU - Rajendran, Simon
AU - Humphries, Matt
AU - Bingham, Victoria
AU - Salto-Tellez, Manuel
AU - Gault, Richard
AU - James, Jacqueline
PY - 2021
Y1 - 2021
N2 - The growth of digital pathology over the past decade has opened new research pathways and insights in cancer prediction and prognosis. In particular, there has been a surge in deep learning and computer vision techniques to analyse digital images. Common practice in this area is to use image pre-processing and augmentation to prevent bias and overfitting, creating a more robust deep learning model. This generally requires consultation of documentation for multiple coding libraries, as well as trial and error to ensure that the techniques used on the images are appropriate. Herein we introduce HistoClean; a user-friendly, graphical user interface that brings together multiple image processing modules into one easy to use toolkit.HistoClean is an application that aims to help bridge the knowledge gap between pathologists, biomedical scientists and computer scientists by providing transparent image augmentation and pre-processing techniques which can be applied without prior coding knowledge.In this study, we utilise HistoClean to pre-process images for a simple convolutional neural network used to detect stromal maturity, improving the accuracy of the model at a tile, region of interest, and patient level. This study demonstrates how HistoClean can be used to improve a standard deep learning workflow via classical image augmentation and pre-processing techniques, even with a relatively simple convolutional neural network architecture. HistoClean is free and open-source and can be downloaded from the Github repository here: https://github.com/HistoCleanQUB/HistoClean.
AB - The growth of digital pathology over the past decade has opened new research pathways and insights in cancer prediction and prognosis. In particular, there has been a surge in deep learning and computer vision techniques to analyse digital images. Common practice in this area is to use image pre-processing and augmentation to prevent bias and overfitting, creating a more robust deep learning model. This generally requires consultation of documentation for multiple coding libraries, as well as trial and error to ensure that the techniques used on the images are appropriate. Herein we introduce HistoClean; a user-friendly, graphical user interface that brings together multiple image processing modules into one easy to use toolkit.HistoClean is an application that aims to help bridge the knowledge gap between pathologists, biomedical scientists and computer scientists by providing transparent image augmentation and pre-processing techniques which can be applied without prior coding knowledge.In this study, we utilise HistoClean to pre-process images for a simple convolutional neural network used to detect stromal maturity, improving the accuracy of the model at a tile, region of interest, and patient level. This study demonstrates how HistoClean can be used to improve a standard deep learning workflow via classical image augmentation and pre-processing techniques, even with a relatively simple convolutional neural network architecture. HistoClean is free and open-source and can be downloaded from the Github repository here: https://github.com/HistoCleanQUB/HistoClean.
KW - digital image analysis
KW - Image pre-processing
KW - Image Enhancement
KW - Artificial Intelligence
KW - Open-source tool
U2 - 10.1016/j.csbj.2021.08.033
DO - 10.1016/j.csbj.2021.08.033
M3 - Article
SN - 2001-0370
VL - 19
SP - 4840
EP - 4853
JO - Computational and Structural Biotechnology Journal
JF - Computational and Structural Biotechnology Journal
ER -