A Systematic Ensemble Approach to Thermodynamic Modeling of Gene Expression from Sequence Data

Md Abul Hassan Samee, Bomyi Lim, Núria Samper, Hang Lu, Christine A. Rushlow, Gerardo Jiménez, Stanislav Y. Shvartsman, Saurabh Sinha

Research output: Contribution to journalArticlepeer-review

31 Scopus citations


Summary To understand the relationship between an enhancer DNA sequence and quantitative gene expression, thermodynamics-driven mathematical models of transcription are often employed. These "sequence-to-expression" models can describe an incomplete or even incorrect set of regulatory relationships if the parameter space is not searched systematically. Here, we focus on an enhancer of the Drosophila gene ind and demonstrate how a systematic search of parameter space can reveal a more comprehensive picture of a gene's regulatory mechanisms, resolve outstanding ambiguities, and suggest testable hypotheses. We describe an approach that generates an ensemble of ind models; all of these models are technically acceptable solutions to the sequence-to-expression problem in light of wild-type data, and some represent mechanistically distinct hypotheses about the regulation of ind. This ensemble can be restricted to biologically plausible models using requirements gleaned from in vivo perturbation experiments. Biologically plausible models make unique predictions about how specific ind enhancer sequences affect ind expression; we validate these predictions in vivo through site mutagenesis in transgenic Drosophila embryos.

Original languageEnglish (US)
Pages (from-to)396-407
Number of pages12
JournalCell Systems
Issue number6
StatePublished - Dec 23 2015

All Science Journal Classification (ASJC) codes

  • Pathology and Forensic Medicine
  • Cell Biology
  • Histology


Dive into the research topics of 'A Systematic Ensemble Approach to Thermodynamic Modeling of Gene Expression from Sequence Data'. Together they form a unique fingerprint.

Cite this