Detecting sequence dependent transcriptional pauses from RNA and protein number time series

Frank Emmert-Streib, Antti Häkkinen, Andre S Ribeiro

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
186 Downloads (Pure)


Background: Evidence suggests that in prokaryotes sequence-dependent transcriptional pauses a?ect the dynamics of transcription and translation, as well as of small genetic circuits. So far, a few pause-prone sequences have been identi?ed from in vitro measurements of transcription elongation kinetics.

Results: Using a stochastic model of gene expression at the nucleotide and codon levels with realistic parameter values, we investigate three di?erent but related questions and present statistical methods for their analysis. First, we show that information from in vivo RNA and protein temporal numbers is su?cient to discriminate between models with and without a pause site in their coding sequence. Second, we demonstrate that it is possible to separate a large variety of models from each other with pauses of various durations and locations in the template by means of a hierarchical clustering and a random forest classi?er. Third, we introduce an approximate likelihood function that allows to estimate the location of a pause site.

Conclusions: This method can aid in detecting unknown pause-prone sequences from temporal measurements of RNA and protein numbers at a genome-wide scale and thus elucidate possible roles that these sequences play in the dynamics of genetic networks and phenotype.
Original languageEnglish
Article number152
Number of pages13
JournalBMC Bioinformatics
Publication statusPublished - 28 Jun 2012

ASJC Scopus subject areas

  • Computer Science Applications
  • Molecular Biology
  • Biochemistry

Fingerprint Dive into the research topics of 'Detecting sequence dependent transcriptional pauses from RNA and protein number time series'. Together they form a unique fingerprint.

Cite this