Biology Major
Biology Department
The global population is aging, and as a consequence, the number of individuals experiencing age-related symptoms is increasing. Senile cataracts occur when the eye lens becomes cloudy due to the buildup of unfolded, or denatured, proteins over time. Cataracts are the leading cause of blindness worldwide and cause a significant decline in quality of life. They are particularly difficult for individuals who live in societies that have minimal accommodations for the sight-impaired. To better understand the formation and structure of cataracts, and the lack thereof in animals that are resistant to cataract formation, we isolated and sequenced the lens crystallin cDNAs from the red caiman. Lens crystallins are the proteins in the eye lens that impart the transparent optical nature to this tissue, and when these crystallins are disrupted, the result is cataract formation. Caimans, being crocodilian, are an evolutionarily ancient and very successful group of animals, meaning that their genome is well-adapted to life on earth. Being cold-blooded, aquatic, and native to the tropics, caimans are subject to a lot of environmental stress. This makes them an ideal candidate to study regarding life-long lens crystallin structure and stability. Lens crystallins, like many proteins, come in a variety of subtle variations – called isoforms. These isoforms arise due to small changes in the genetic code that cause changes in key amino acids during alternative splicing. These variants all have similar, but slightly different structures when folded into their final forms as proteins. We used a novel technology that uses full-length isoform sequencing (Iso-Seq), a method of long-read cDNA sequencing, to inventory the mRNA transcript-level variations in the caiman lens crystallins that result in their many isoforms. With this data, we hope to pinpoint not only the isoforms that may be particularly stable or resistant to stress, but also any mutations at specific amino acid residues that make them uniquely resistant to senile insult. Current treatment for cataracts is surgical, but with the increasing aging population, cataract surgery may prove to be ineffective at meeting the demand for cataract treatment. Therefore, the hope is for this data to be used in the development of a non-surgical treatment for cataracts, such as a prophylactic treatment similar to fluorination of drinking water, or the addition of folic acid to cereal grains.
Which lens crystallin isoforms exhibit increased structural stability?
What amino acid residues are responsible for increased lens crystallin stability?
Our work centered around isoform analysis to identify those that are particularly stable or resistant to stress, along with the mutations at specific amino acid residues responsible for their characteristic structural integrity. Because lens crystallins are highly conserved, we opted to use a novel technology for our isoform analysis. The PacBio Vega was recently acquired by the Center for Bioinformatics and Functional Genomics (CBFG) here at Miami. We used the PacBio Vega to perform our isoform analysis, which uses full-length isoform sequencing (Iso-Seq), a method of long-read cDNA sequencing, to inventory the mRNA transcript-level variations in the caiman lens crystallins that result in their many isoforms. Previously, this type of analysis would have had to have been done by short-read cDNA sequencing, which essentially chops up and compares pieces of each isoform sequence, then puts them back together. However, the short fragments can be hard to keep track of, especially for highly conserved proteins like lens crystallins, because they have so many domains with the same sequence. Long-read sequencing allowed us to have a more streamlined analysis by removing the step of rebuilding the whole isoform sequence, which also reduces error.
Preliminary results from visual inspection yield 135 alpha crystallin sequences, 594 beta crystallin sequences, & 294 gamma crystallin sequences. Current literature notes 2 types of alpha crystallins, 7 types of beta crystallins, and 5 types of gamma crystallins, so this is many more sequences than originally anticipated. Currently, we are in the process of sorting through these isoform sequences to distinguish legitimate isoforms from long-form artifacts. That said, the high number of sequences alone suggests the presence of novel isoforms for all types of crystallins. This increased diversity of lens crystallin isoforms may be a mechanism for introducing heterogeneity within each cohort of crystallin proteins. It may also be a mechanism for preventing crystalline formation of the crystallin proteins within the eye lens, as the different isoforms would not bind and be oriented in the same manner.
Next steps for this project involve continuing to sort through our sequence data to further narrow down which sequences are legitimate isoforms. Beyond this, we could use mass spectrometry to validate which isoforms are expressed (as protein) in the lens, since not all mRNAs end up being expressed as proteins. We could also use in silico structural models for the crystallins to better determine residue placement, and therefore function, in stability. Finally, we could also correlate isoform expression with epigenetic markers in lens DNA by sequencing the Red Caiman genome and mapping the isoforms back to it.
Regarding clinical applications, this data could aid in the development of a non-surgical treatment for cataracts, such as a prophylactic, like folic acid in breakfast cereal. A prophylactic treatment would be more accessible than the current standard treatment for cataracts, cataract removal surgery. This may be able to better meet the rising demand for cataract treatments that is occurring in tandem with the aging global population.
The following is an image of poster presented at the 2026 Undergraduate Research Forum
I would like to thank the Center for Bioinformatics and Functional Genomics, the Department of Biology, and the College of Arts and Science here at Miami University for their support regarding undergraduate research. I would also like to thank the Center for Career Exploration and Success for their financial support by awarding me a Research/Internship Grant for the Fall '25 semester.
Wistow, G. (2012). The human crystallin gene families. Human Genomics, 6(1), 26. https://doi.org/10.1186/1479-7364-6-26
International Union for Conservation of Nature. (n.d.). Species conservation planning principles and steps. https://www.iucncsg.org/365_docs/attachments/protarea/05_C-78894f16.pdf
Oaks, J. R. (2011). A time-calibrated species tree of Crocodylia reveals a recent radiation of the true crocodiles. Evolution, 65(11), 3285–3297.
Hoffman, D. K., Goldsmith, E. R., Houssaye, A., Maidment, S. C. R., Felice, R. N., & Mannion, P. D. (2025). Evolution of growth strategy in alligators and caimans informed by osteohistology of the late Eocene early-diverging alligatoroid crocodylian Diplocynodon hantoniensis. Journal of Anatomy, 247(1), 165–178. https://doi.org/10.1111/joa.14231
Harrington, V., et al. (2004). Crystallins in water soluble-high molecular weight protein fractions and water insoluble protein fractions in aging and cataractous human lenses. Molecular Vision, 10, 476–489.
Okamoto, K., Dopkins, N., & Kinfu, E. (2025). A draft genome sequence of the common, or spectacled caiman Caiman crocodilus (Version 2). F1000Research, 10, 1230. https://doi.org/10.12688/f1000research.73066.2
Pacific Biosciences. (n.d.). RNA sequencing. https://www.pacb.com/products-and-services/applications/rna-sequencing/
Pacific Biosciences. (n.d.). Iso-Seq documentation. https://isoseq.how/
Das, G. K., et al. (2019). Presenile cataract and its risk factors: A case control study. Journal of Family Medicine and Primary Care, 8(6), 2120–2123. https://doi.org/10.4103/jfmpc.jfmpc_267_19
Career & Self-Development: This project allowed me to build my skillset both in the wet lab (e.g., micropipetting, running PCRs) and dry lab (e.g., analysis using bioinformatic software). Moreover, it allowed me to apply concepts and techniques that I had learned about in my courses to a real-world setting.
Communication: Communication was key for this project, especially when it came to obtaining clarification regarding next steps. This project helped me learn how to advocate for myself both with my mentor and with representatives from outside organizations (in this case, PacBio).
Technology: I was able to complete this project because of the newly acquired PacBio Vega at the CBFG. It was because we had this novel technology that this project was feasible to complete within a year, and once our full analysis has been run, it should yield more accurate results.