Batch effect removal methods for microarray gene expression data integration: a survey

Vasile Cosmin Lazar, Stijn Meganck, Jonatan Taminau, David Steenhoff, Alain Coletta, Colin Molter, David Y. Weiss-Solis, Robin Duqué, Hugues Bersini, Ann Nowe

Research output: Contribution to journalArticle

151 Citations (Scopus)


Genomic data integration is a key goal to be achieved towards large-scale genomic data analysis. This process is very challenging due to the diverse sources of information resulting from genomics experiments. In this work, we review methods designed to combine genomic data recorded from microarray gene expression (MAGE) experiments. It has been acknowledged that the main source of variation between different MAGE datasets is due to the so-called 'batch effects'. The methods reviewed here perform data integration by removing (or more precisely attempting to remove) the unwanted variation associated with batch effects. They are presented in a unified framework together with a wide range of evaluation tools, which are mandatory in assessing the efficiency and the quality of the data integration process. We provide a systematic description of the MAGE data integration methodology together with some basic recommendation to help the users in choosing the appropriate tools to integrate MAGE data for large-scale analysis; and also how to evaluate them from different perspectives in order to quantify their efficiency. All genomic data used in this study for illustration purposes were retrieved from InSilicoDB
Original languageEnglish
Pages (from-to)469-490
JournalBriefings in Bioinformatics
Issue number4
Publication statusPublished - 31 Jul 2012


  • Microarray gene expression data
  • batch effect removal
  • large-scale genomic data analysis
  • combining microarray datasets
  • microarray gene expression data merging
  • data integration

Fingerprint Dive into the research topics of 'Batch effect removal methods for microarray gene expression data integration: a survey'. Together they form a unique fingerprint.

Cite this