(2025 Jan)
Welcome to the course webpage for DS202: Algorithmic Foundations of Big Data Biology.
The objective of this course is to provide a broad overview of algorithms and data structures for biological sequence analysis. There will be a significant concentration on genomics and related problems such as high-throughput pattern matching of biological sequences, data compression algorithms, gene finding, genome assembly and phylogenetics. Hands-on programming assignments will be offered to appreciate the complexities of real-world data. Tentative topics and prerequisites are listed here. Many algorithmic techniques taught in this course are applicable to areas beyond biology such as text mining, plagiarism checking, web searching and natural language processing.
Announcements
[12/03] Project presentation and code (on GitHub) due on Apr 11 (Fri)
[12/03] Project presentations to begin on Mar 26 (Wed)
[12/03] Homework-4 posted, due by Mar 24 (Mon)
[27/02] Project preliminary report due by Mar 10 (Mon)
[14/02] Midterm exam syllabus - Week 1 to Week 6
[05/02] Midterm exam is scheduled on Feb 19 (10:10 - 11:40 AM)
[27/01] Homework-3 posted, due by Feb 10 (Mon)
[27/01] Homework-2 posted, due by Feb 10 (Mon)
[13/01] Homework-1 posted, due by Jan 20 (Mon)
[06/01] Homework-0 posted, due by Jan 11 (Sat)
[03/12] First class is scheduled on Jan 06, 2025 (Mon) at 10:00; Venue: CDS 202
Logistics
Class days/time: MWF 10 am - 11 am
Course website: https://sites.google.com/view/ds202
Link to join Teams Course Channel
Class location: CDS 202
Office hours: After class or by appointment
TA: Sudhanva S Kamath
TA office hours: Tuesdays 10:00 - 11:00 am in CDS 325