EMDomics: A robust and powerful method for the identification of genes differentially expressed between heterogeneous classes

Sheida Nabavi, Daniel Schmolze, Mayinuer Maitituoheti, Sadhika Malladi, Andrew H. Beck

Research output: Contribution to journalArticlepeer-review

41 Scopus citations

Abstract

Motivation: A major goal of biomedical research is to identify molecular features associated with a biological or clinical class of interest. Differential expression analysis has long been used for this purpose; however, conventional methods perform poorly when applied to data with high within class heterogeneity. Results: To address this challenge, we developed EMDomics, a new method that uses the Earth mover's distance to measure the overall difference between the distributions of a gene's expression in two classes of samples and uses permutations to obtain q-values for each gene. We applied EMDomics to the challenging problem of identifying genes associated with drug resistance in ovarian cancer. We also used simulated data to evaluate the performance of EMDomics, in terms of sensitivity and specificity for identifying differentially expressed gene in classes with high within class heterogeneity. In both the simulated and real biological data, EMDomics outperformed competing approaches for the identification of differentially expressed genes, and EMDomics was significantly more powerful than conventional methods for the identification of drug resistance-associated gene sets. EMDomics represents a new approach for the identification of genes differentially expressed between heterogeneous classes and has utility in a wide range of complex biomedical conditions in which sample classes show within class heterogeneity. Availability and implementation: The R package is available at http://www.bioconductor.org/packages/release/bioc/html/EMDomics.html Supplementary information: supplementary data are available at Bioinformatics online.

Original languageEnglish (US)
Pages (from-to)533-541
Number of pages9
JournalBioinformatics
Volume32
Issue number4
DOIs
StatePublished - Feb 15 2016

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'EMDomics: A robust and powerful method for the identification of genes differentially expressed between heterogeneous classes'. Together they form a unique fingerprint.

Cite this