A comparative benchmark of classic DNA motif discovery tools on synthetic data

Stefano Castellana, Tommaso Biagini, Luca Parca, Francesco Petrizzelli, Salvatore Daniele Bianco, Angelo Luigi Vescovi, Massimo Carella, Tommaso Mazza

Research output: Contribution to journalArticlepeer-review

Abstract

Hundreds of human proteins were found to establish transient interactions with rather degenerated consensus DNA sequences or motifs. Identifying these motifs and the genomic sites where interactions occur represent one of the most challenging research goals in modern molecular biology and bioinformatics. The last twenty years witnessed an explosion of computational tools designed to perform this task, whose performance has been last compared fifteen years ago. Here, we survey sixteen of them, benchmark their ability to identify known motifs nested in twenty-nine simulated sequence datasets, and finally report their strengths, weaknesses, and complementarity.

Original languageEnglish (US)
JournalBriefings in bioinformatics
Volume22
Issue number6
DOIs
StatePublished - Nov 5 2021

Keywords

  • benchmark
  • computational biology
  • genomics
  • motif
  • sequence pattern

ASJC Scopus subject areas

  • Information Systems
  • Molecular Biology

Fingerprint

Dive into the research topics of 'A comparative benchmark of classic DNA motif discovery tools on synthetic data'. Together they form a unique fingerprint.

Cite this