DS 202: Algorithmic Foundations of Big Data Biology 

(2024 Jan)

Welcome to the course webpage for DS202: Algorithmic Foundations of Big Data Biology.


The objective of this course is to provide a broad overview of algorithms and data structures for biological sequence analysis. There will be a significant concentration on genomics and related problems such as high-throughput pattern matching of biological sequences, data compression algorithms, gene finding, genome assembly and phylogenetics. Hands-on programming assignments will be offered to appreciate the complexities of real-world data. Tentative topics and prerequisites are listed here. Many algorithmic techniques taught in this course are also applicable to areas beyond biology such as text mining, plagiarism checking, web searching and natural language processing.

 Announcements

 Logistics