Synthetic biology is a growing field that develops novel biological systems to solve pressing global challenges. One goal of synthetic biology is to engineer microbes into sustainable biomolecule factories: by integrating foreign genes that encode a metabolic pathway to generate a molecule of interest, an engineered microorganism can be used to biologically convert renewable feedstocks into valuable materials. While there have been successful efforts using sugar-consuming model organisms, such as Saccharomyces cerevisiae or Escherichia coli, there is a strong case for developing an industrial production platform in a bacterium such as Methylomicrobium buryatense 5GB1, which instead consumes methane, a potent greenhouse gas second only to CO2 in its contributions to anthropogenic climate change.
However, incorporating new engineering that optimally functions within a host microbe is extremely difficult without knowledge of the sequences controlling gene expression - in other words, its "genetic grammar." To better understand the DNA sequence motifs and regulatory patterns that govern M. buryatense gene expression, we have curated an ensemble of RNA-seq datasets collected across diverse growth conditions. We are mining these data with various computational techniques in order to find motifs and expression patterns relevant for developing further metabolic engineering tools in this organism.