mapDIA: Preprocessing and statistical analysis of quantitative proteomics data from data independent acquisition mass spectrometry

Guoshou Teo, Sinae Kim, Chih-Chiang Tsou, Ben Collins, Anne-Claude Gingras, Alexey I. Nesvizhskii, Hyungwon Choi

Research output: Contribution to journalArticlepeer-review

105 Citations (Scopus)
287 Downloads (Pure)


Data independent acquisition (DIA) mass spectrometry is an emerging technique that offers more complete detection and quantification of peptides and proteins across multiple samples. DIA allows fragment-level quantification, which can be considered as repeated measurements of the abundance of the corresponding peptides and proteins in the downstream statistical analysis. However, few statistical approaches are available for aggregating these complex fragment-level data into peptide- or protein-level statistical summaries. In this work, we describe a software package, mapDIA, for statistical analysis of differential protein expression using DIA fragment-level intensities. The workflow consists of three major steps: intensity normalization, peptide/fragment selection, and statistical analysis. First, mapDIA offers normalization of fragment-level intensities by total intensity sums as well as a novel alternative normalization by local intensity sums in retention time space. Second, mapDIA removes outlier observations and selects peptides/fragments that preserve the major quantitative patterns across all samples for each protein. Last, using the selected fragments and peptides, mapDIA performs model-based statistical significance analysis of protein-level differential expression between specified groups of samples. Using a comprehensive set of simulation datasets, we show that mapDIA detects differentially expressed proteins with accurate control of the false discovery rates. We also describe the analysis procedure in detail using two recently published DIA datasets generated for 14–3-3β dynamic interaction network and prostate cancer glycoproteome. Availability The software was written in C ++ language and the source code is available for free through SourceForge website article is part of a Special Issue entitled: Computational Proteomics.
Original languageEnglish
Pages (from-to)108-120
Number of pages13
JournalJournal of proteomics
Early online date15 Sept 2015
Publication statusPublished - 03 Nov 2015


  • Data independent acquisition
  • Data preprocessing
  • Normalization
  • Differential expression


Dive into the research topics of 'mapDIA: Preprocessing and statistical analysis of quantitative proteomics data from data independent acquisition mass spectrometry'. Together they form a unique fingerprint.

Cite this