Computational and Physical genomics Lab

Fig. 1. A sketch of DNA packaging in a eukaryotic nucleus. If we zoom in the nucleus, we see that DNA is wrapped around eight histones and resembles beads on a string structure. This bead is called nucleosome (~10nm in size). Two types of chromatin fibers can broadly be classified: regular vs irregular fibers. So far regular fibers can only be seen in vitro whereas irregular fibers have yet to be observed in vivo. If we zoom-in the nucleosome, we can see that 147 base-pair (bp) of DNA is wrapped around the histone proteins that looks like a bead shown here. In the crystal structure we can see how DNA is nicely wrapped around the histone octamer.

Our genome consists of 46 DNA polymers and when fully stretched, they can be as long as ~2 meters in total. However, when compacted to ~10𝜇 they form structures called chromosomes required for faithful division of cells. Such a high-density compaction state of chromatin is a well-defined state however the folding pathway to final state is still not well understood. In a relaxed state, chromatin take beads-on-string conformation—the bead is called nucleosome. Almost all nuclear activities are dependent on nucleosome dynamics and the relaxed state of the genome. Nucleosomes are formed by wrapping DNA around histone octamer. Researchers have been trying to explain how nucleosomes organize the genome. The DNA sequence and non-histone factors (NHFs) such as remodeling factors (RFs) and transcription factors (TFs) are some of the key molecular players that can directly interact and regulate genome organization and dynamics (fig. 1). Understanding these molecular components and the mechanisms of genome organization will not only solve questions that concern cellular aging, heterogeneity, differentiation, and evolution but also open new technologies in the field of personal medicine. Here we want to propose some key objectives of our research to understand the physical and sequence origin of nucleosome positioning and chromatin folding.


Fig. 2. An MNase-seq based nucleosome occupancy in budding yeast. NFRs are located at the 5’ and 3’ ends of the genes. Nucleosomes are highly positioned near the 5’ end.

Physical and genomic determinants of nucleosome positioning

MNase-seq, ODM-seq, long read-seq, and other experiments suggest that nucleosome positioning in the genome is not random, but it is well organized. For instance, in the nearby strongly positioned nucleosome arrays, regulatory elements viz., nucleosome-free regions (NFRs) or nucleosome-depleted regions (NDRs), and nucleosome-inhibitory energy barriers (NIEBs) can be found [1–3] (Fig. 2). Interestingly, these regions correlate well with various vital genome activities viz., transcription, replication, and DNA-repair. Therefore, one of the main goals of my research is to understand what determines nucleosome positioning and how to predict it. To achieve this goal, we use statistical mechanics, computational methods and simulations, and artificial intelligence (AI) or machine learning (ML). The work will explain why some genomic regions are nucleosome-rich and some are not, which will in turn benefit experimental collaborators to design and ask more fundamental questions about how NHFs access DNA.

Fig. 3. Chromatin fibers with non-uniform linker DNA: From low (a) to high (d) variance in linker length. odd- and even-numbered nucleosomes are shown in orange and blue, respectively, and the DNA segment is colored in grey.

Chromatin folding pathway and stability

The evidence of 30nm chromatin fibre mostly come from in vitro studies. However, in vivo studies suggest that chromatin fibers are irregular, and the regular fibers are yet to be observed. The question is what determines the folding pathway of the irregular fibers? It has been shown that nucleosome positioning can affect the folding of 3D chromatin fiber by bringing genomic loci to physical contact referred to as “gene looping” [4–6]. However, in the presence of nucleosome breathing and the binding of NHFs the folding cannot be predicted by only the nucleosome positioning. To understand the folding problem, we will start by constructing a simple nucleosome-linker polymer model and study its motion using Brownian Dynamics (BD) simulations (Fig. 3). Next we will introduce nucleosome breathing and in vivo linker DNA distribution and calculate contact frequency and compare with experimental data.

References

1. G. Drillon, B. Audit, F. Argoul and A. Arneodo, "Evidence of selection for an accessible nucleosomal array in human," BMC Genomics, vol.

17, p. 526, 2016.

2. H. Kharerin, P. J. Bhat, J. F. Marko and R. Padinhateeri, "Role of transcription factor-mediated nucleosome disassembly in PHO5 gene

expression," Sci Rep, vol. 6, p. 20319, 2016.

3. H. Kharerin and L. Bai, "Thermodynamic modeling of genome-wide nucleosome depleted regions in yeast," PLoS Comput Biol, vol. 17, p.

e1008560, 2021.

4. T.-H. S. Hsieh, A. Weiner, B. Lajoie, J. Dekker, N. Friedman and O. J. Rando, "Mapping Nucleosome Resolution Chromosome Folding in

Yeast by Micro-C," Cell, vol. 162, pp. 108-119, 2015.

5. H. Kharerin, P. J. Bhat and R. Padinhateeri, "Role of nucleosome positioning in 3D chromatin organization and loop formation," J Biosci, vol.

45, p. 14, 2020.

6. M. Ohno, T. Ando, D. G. Priest, V. Kumar, Y. Yoshida and Y. Taniguchi, "Sub-nucleosomal Genome Structure Reveals Distinct Nucleosome

Folding Motifs," Cell, vol. 176, pp. 520-534, 2019.