RMA with quantile normalization mixes biological signals between different sample groups in microarray data analysis

Chang Sik Kim, Seungwoo Hwang, Shu-Dong Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

Quantile normalization (QN) is a technique for microarray data processing and is the default normalization method in the Robust Multi-array Average (RMA) procedure, which was primarily designed for analysing gene expression data from Affymetrix arrays. Given the abundance of Affymetrix microarrays and the popularity of the RMA method, it is crucially important that the normalization procedure is applied appropriately. In this study we carried out simulation experiments and also analysed real microarray data to investigate the suitability of RMA when it is applied to dataset with different groups of biological samples. From our experiments, we showed that RMA with QN does not preserve the biological signal included in each group, but rather it would mix the signals between the groups. We also showed that the Median Polish method in the summarization step of RMA has similar mixing effect. RMA is one of the most widely used methods in microarray data processing and has been applied to a vast volume of data in biomedical research. The problematic behaviour of this method suggests that previous studies employing RMA could have been misadvised or adversely affected. Therefore we think it is crucially important that the research community recognizes the issue and starts to address it. The two core elements of the RMA method, quantile normalization and Median Polish, both have the undesirable effects of mixing biological signals between different sample groups, which can be detrimental to drawing valid biological conclusions and to any subsequent analyses. Based on the evidence presented here and that in the literature, we recommend exercising caution when using RMA as a method of processing microarray gene expression data, particularly in situations where there are likely to be unknown subgroups of samples.
Original languageEnglish
Title of host publication2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages139-143
Number of pages5
DOIs
Publication statusPublished - Nov 2014
EventThe IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2014 - Belfast, United Kingdom
Duration: 02 Nov 201405 Nov 2014

Conference

ConferenceThe IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2014
Country/TerritoryUnited Kingdom
CityBelfast
Period02/11/201405/11/2014

Fingerprint

Dive into the research topics of 'RMA with quantile normalization mixes biological signals between different sample groups in microarray data analysis'. Together they form a unique fingerprint.

Cite this