In this module we will give you an overview of exploratory and statistical data analysis methods using R and RStudio. As preparation for this module have a look at Introduction to R
My research is focused on understanding the principles of protein dynamics using computational models and simulations. These simulations generate GB to TB of data, requiring algorithms that can crunch large datasets, in parallel. Contact me, if you are interested in
Protein/DNA design
Time series data analysis
Unsupervised/supervised learning techniques
Data compression and storage
Topological data analysis methods
Retrieval, analysis, and annotation of genomics/structural databases
CPU/GPU parallelization of any of the above methods
Preferred methods of investigation: Python, R, bash in a linux (Ubuntu) or linux like environment (mac OS).
Funding for PhD projects is currently not available. If you are interesting in exploring funding opportunities, please email me your CV and a short (~250 words) description of your research interest.