Below are brief descriptions for the research topics in which I have been or currently am interested. Please visit the multimedia page for presentations and posters on these topics. Feel free to contact me (Daniel.Cruz[at]medicine.ufl.edu) if you have any questions or comments!
You can view my list of publications on ORCID (ID: 0000-0001-5832-5228), Google Scholar, or at the bottom of this page (along with manuscripts in progress).
To realize the promise of personalized medicine, modelers need to be able to integrate different types of data collected from a patient into a computational framework that enables decision-making about optimal interventions to help the patient to either maintain or regain health. A fundamental problem that currently does not have a widely applicable solution is how to calibrate a generic computational model of human biology to a given patient at a given time. In joint work with members of the Laboratory for Systems Medicine and the Mehrad Lab at the University of Florida, I am working on data assimilation algorithms to address this issue.
In a recent manuscript [Knapp et al 2025], my coauthors and I present an algorithm to address the problem above for the agent-based model framework, commonly used to capture stochastic and spatially heterogeneous biological processes, such as tumor growth or immune system dynamics. This algorithm addresses the key challenge of bridging the gap between the clinically measurable quantities (the macro-state) and the fine-grained data at different physiological scales which are required to run the model (the micro-state).
Overview of data assimilation algorithm for agent-based models [Knapp et al 2025].
(Top) Immunofluorescence image taken by Dr. Eunbi Park. (Bottom) Overview for microscopy image analysis pipeline [Hartsock et al 2025].
The differentiation of stem cell colonies into specified tissue types is possible through local and long-distance intercellular communication; however, it is unclear which local mechanisms take priority in context-specific situations as the fates of cell colonies are established. In joint work with Elena Dimitrova, Melissa Kemp, Eunbi Park, and Jack Toppen, we consider human induced pluripotent stem cells (hiPSCs) whose therapeutic potential arises from their ability to differentiate into all germ layers. Prior work in the literature [Guye et al 2016] suggests that both cell-autonomous and non-autonomous (e.g. positional) mechanisms determine cell fate during the differentiation of hiSPCs, producing patterns and other system-level features in the process.
Informed by experimental data, we have focused our work on three interrelated projects:
The development of a general-use, computational pipeline to quantitatively examine the multicellular organization and pattern formation of hiPSC colonies using topological data analysis [Hartsock et al 2025].
The development of an agent-based model (ABM) of early hiPSC differentiation in which agents (i.e. cells) are equipped with a Boolean network model of a target pathway serving as a potential mechanism for intercellular communication [Park et al 2025, in preparation].
The generalization of a mathematical framework [Yereniuk & Olson 2019] which formalizes ABMs for the purpose of estimating long-term behavior of biologically-inspired ABMs without simulations, specifically changes in agent population densities over time [Cruz et al 2025, in preparation].
With the above, we aim to ascertain which modes of intercellular communication determine cell fate by studying both local interactions and emergent colony behaviors.
A double occurrence word (DOW) is a word (i.e., a sequence of symbols) in which every symbol appears exactly twice. In some instances, DNA rearrangement processes can be modeled using DOWs. It was discovered that over 95% of the scrambled genome of Oxytricha trifallax could be described using the so called repeat (aa) and return (aa^R) patterns, with gaps allowed between the a's [Burns et al 2016]. A pattern is said to "appear" in a DOW w if a can be uniquely associated with a sequence of distinct symbols in w.
The proposed unscrambling process in [Burns et al 2016] involves removing repeat/return words from a given DOW one at a time; this removal process was studied formally in [Jonoska et al 2017]. In order to understand how so much scrambling may have arisen within the genome of Oxytricha trifallax, my collaborators and I studied the insertion of repeat and return words into DOWs in [Cruz et al 2020] and characterized the structure of a given DOW w which allows two distinct insertions to yield equivalent DOWs. The characterization introduces a method to generate families of words recursively.
Figure visualizing the DNA rearrangement process for Oxytricha trifallax [Jonoska et al 2017].
Experimental design for SIMAs using DNA strand displacement to allow floating tiles to attach to a predesigned array (i.e. platform). Image obtained from [Jonoska & Seeman 2015].
From the molecular to the microbiological scale and beyond, the organization of elementary "building blocks" may not only be influenced by local interactions but by environmental changes and other external controls. A system of interactive molecular arrangement (SIMA) is a model which utilizes the aforementioned dynamics and is based on experimental design [Jonoska & Seeman 2015]. In this model, two types (species) of floating molecular (DNA) tiles attach to a predesigned array in an alternating manner according to an external control. The array consists of neighboring tiles which "communicate" and which are complementary to the floating tiles. The synchronous attachment of a single species of floating tiles makes the SIMA model similar to the cellular automaton (CA) model. My collaborators and I formally described the SIMA model in [Braun et al 2017] and explored some of its computational capabilities, but several questions remain unanswered.
A set of elementary building blocks undergoes self-assembly if local interactions govern how this set forms intricate structures. Self-assembly has been widely observed in nature, ranging from the field of crystallography to the study of viruses and multicellular organisms. A natural question is whether a model of self-assembly can capture the hierarchical growth seen in nature or in other fields of mathematics. In my dissertation, I considered hierarchical growth in substitution rules. Informally, a substitution rule describes the iterated process by which the polygons of a given set are individually enlarged and dissected; see [Goodman-Strauss 1998] for more information.
To study this topic, I developed the Polygonal Two-Handed Assembly Model (p-2HAM) where building blocks, or tiles, are polygons and growth occurs when tiles bind to one another via matching, complementary bonds on adjacent sides; the resulting assemblies can then be used to construct new, larger structures. The p-2HAM is based on a handful of well-studied models, notably the Two-Handed Assembly Model and the polygonal free-body Tile Assembly Model; see [Patitz 2014] for more information. The primary focus of my dissertation [Cruz 2019] was to provide conditions which are either necessary or sufficient for the "bordered simulation" substitution rules. This kind of "simulation" involves a border made up of tiles which forms around an assembly which then coordinates how the assembly interacts with other assemblies. See the example in the image below based on the non-pinwheel substitution rule (top of the image). I am still interested in finding a necessary and sufficient condition which generalizes my dissertation work.
A.C. Knapp*, D.A. Cruz*, B. Mehrad, R.C. Laubenbacher. Personalizing computational models to construct medical digital twins. Journal of the Royal Society Interface 22, 20250055 (2025). DOI: 10.1098/rsif.2025.0055
I. Hartsock*, E. Park*, J. Toppen, E.S. Dimitrova, M.L. Kemp, P. Bubenik, D.A. Cruz. Topological data analysis of pattern formation of human induced pluripotent stem cell colonies. Scientific Reports 15, 11544 (2025). DOI: 10.1038/s41598-025-90592-1
D.A. Cruz*, M.L Kemp*. Hybrid computational modeling methods for systems biology. Progress in Biomedical Engineering 4 (2021) 012002. DOI: 10.1088/2516-1091/ac2cdf
D.A. Cruz*, M.M. Ferrari*, N. Jonoska*, L. Nabergall*, M. Saito*. Insertions yielding equivalent double occurrence words. Fundamenta Informaticae 171: 1–4, (2020) 113–132. DOI: 10.3233/FI-2020-1875
J. Braun*, D. Cruz*, N. Jonoska*. Platform color designs for interactive molecular arrangements. Unconventional Computation and Natural Computation LNCS (M.J. Patitz, M. Stannett, eds) 10240, (2017) 69-81. DOI: 10.1007/978-3-319-58187-3_6
D.A. Cruz, J. Toppen, E. Park, M.L. Kemp, E.S. Dimitrova. Estimating the long-term behavior of biologically inspired agent-based models. In preparation (2025); partial preprint: https://arxiv.org/abs/2211.00630.
L. Sordo Vieira*, D.A. Cruz*. Towards complement-based therapeutics: opportunities for mathematical modeling. In preparation (2025); full preprint available upon request.
E. Park*, D.A. Cruz*, J. Toppen, E.S. Dimitrova, M.L. Kemp. Extraction of transcriptional network within human induced pluripotent colonies via Boolean logic. In preparation (2025).