Research

Molecular Systems Virology

The molecular mechanisms viruses use to infect cells are a window into normal cell biology and can help identify new therapeutic targets to treat viral infections.

Viruses have been critical in defining many basic cell biology and molecular biology principles in the 20th century, including helping define the very nature of DNA as the genetic material, elucidating basic mechanisms of gene-regulation, and identification of cancer promoting ("oncogenes") and inhibitory ("tumor suppressors") cellular proteins.

Our lab uses cutting-edge molecular and computational techniques to study how viruses manipulate human cells, in order to further our understanding of fundamental mechanisms viruses use to replicate, as well as identify new therapeutic targets to inhibit infection.


Systems Analysis of Human Cytomegalovirus (HCMV) Infection. From doi: 10.1073/pnas.1710799114

Pathways Analysis of Viral Infection

All viruses rely on cellular signaling pathways and metabolic programs to make copies of themselves. Classic examples are ribosomal translation, ATP generation, and nucleotide biosynthesis. All cells translate proteins and generate ATP and most can either import or sythesize nucleotides. Yet, viruses cannot infect all cells. Each virus has a defined "tropism", or capacity to infect different cell types.

Numerous examples of non-permissive and semi-permissive infections have been observed in experimental infection systems, illustrating that some cell types posses intrinsic barriers to infection. In many cases, non-permissive cell types simply lack surface entry factors required to internalize virus particles, but in others virus particles successfully enter cells, but the infectious cycle stalls. The existence of these "semi-permissive" infections implies that cellular pathways besides translation and ATP generation are required to support viral replication; and that limiting cellular functions downstream of entry can be identified.

We are pursuing two general approaches to identify new cellular pathways required for virus replication: (1) computational analysis of gene-expression patterns in virally-infected cells, and (2) restoration of essential "host-factors" in non-permissive and semi-permissive cell types.

Pathways Analysis of HCMV Infection Across Two Cell Types. (C-E ) Gene set (pathway) overlap matrices at 24, 72, and 120 hours post-infection used to create gene set networks in panels F - H. (F-H ) Data reduction and community identification of related pathways during HCMV infection. Each node (circle) represents a pathway altered by HCMV infection. Local communities of related biological function are clustered into communities using self-organizing, force-directed, networks.

Cartoon and Cryo-Electron Microscopy Reconstruction of HCMV Virions. Panels A, C, & D modified from Yu, X. et al. Journal of Structural Biology 174, 451–460 (2011). Panel B modified from http://viralzone.expasy.org.

Human Cytomegalovirus

A major focus of our effort is the opportunistic human herpesvirus, human cytomegalovirus (HCMV). HCMV remains and underappreciated human pathogen and causes serious disease in transplant patients and infants. There is no vaccine for HCMV and the virus rapidly evolves resistance to anti-viral drugs.

We have identified several cell culture models where HCMV enters cells, but fails to produce infectious particles. By studying these states, we intend to let biology teach us how to inhibit HCMV, which we hope will lead to new therapeutic strategies.

We are also developing computational infrastructure to identify cellular pathways required by a variety of pathogenic human viruses. We intend to complement these in silico analyses with experimental infection models either in-house or through collaboration.

Data-mining of public gene-expression data:

Gene-expression profiling is a powerful methodology for determining how cells respond to viral infection. Recent technologies have enabled routine acquisition of global gene-expression patterns (transcriptomes) and sophisticated computational methods have been developed to monitor how sets of genes coordinately respond to different experimental treatments. Databases of thousands of gene-expression experiments, across many biological domains, have accumulated and are accessible to biomedical researchers. This wealth of data can potentially be used to extract useful biological information through cross-study re-analysis, or "meta-analysis."

However, public data is challenging to use for discovery. An apical bottleneck to data reuse, is metadata organization and standardization. Public databases, such as the NCBI's Gene-Expression Omnibus (GEO), require submission of metadata describing each sample's biological parameters (e.g. "cell-type", "treatment", or "replicate"). However, due to the vastly different types of experiments submitted to such databases, rigid metadata vocabularies have not been established. Therefore, it can be difficult to organize many different studies and metadata typically needs to be coerced into a consistent format, on an ad-hoc basis, for re-analysis. This is especially important when analyzing dozens to hundreds of data sets or scripting automated processing routines.

We are developing several tools to aid in acquisition, organization, and standardization of sequencing metadata. These tools enable metadata organization for single to hundreds of sequencing studies, can be deployed by individual users or teams of researchers, and allow preparation of customized metadata databases that can be easily parsed by automated data processing routines. We intend to use these tools to perform large meta-analyses of both public and newly acquired gene-expression data sets.

Cell-state Control of Herpesvirus Infection:

We have identified the epithelial-to-mesenchymal transition (EMT) pathway as a cellular pathway strongly modulated during HCMV infection. EMT is a migratory program induced during development and tissue repair where static epithelial cells acquire a motile, "mesenchymal" phenotype. Mesenchymal cells can revert to an epithelial phenotype by activation of an inverse cellular program, termed "mesenchymal-to-epithelial transition (MET)." Cycles of EMT and MET have been shown to be important during development, wound-healing, and different phases of tumorigenesis.

Simple model of cellular changes induced by EMT and MET. Secreted cytokines, such as TGF-b, induce trans-differentiation of epithelial sheets into scattered mesenchymal (M) cells. Mesenchymal cells can be reverted back to the epithelial (E) state through MET.

HCMV infection is sensitive to the activation state of the EMT and MET pathways during infection. We are trying to understand how the mesenchymal and epithelial cell states influence HCMV infection. By genetically and chemically modulating the EMT and MET programs, and observing the effect on HCMV infection, we hope to define how these cell states influence HCMV replication. We are exploring the possibility that the mesenchymal and epithelial states influence the pattern of HCMV infection - whether HCMV replicates productively, stalls, or enters a quiescent state termed "latency." We are also examining how the mesenchymal and epithelial states influence related human herpesviruses, such as Herpes-simplex 1 (HSV-1) and Kaposi's Sarcoma-associated Virus (KSHV).

Moduation of EMT and MET pathways during HCMV infection. Left, modulation of EMT and MET genes during in vitro infection of ARPE-19 epithelial cells and MRC-5 fibroblasts. Right, western anlaysis of HCMV infected ARPE-19 and MRC-5 cells showing upregulation of epithelial proteins E-cadherin and EpCAM, and down-regulation of the mesenchymal protein Fibronectin.