CS196-1: Algorithmic Foundations of Computational Biology

 

Semester:

Spring 2008

Schedule:

Tuesdays and Thursdays 2:30-3:50pm (K)

Location:

CIT Swig Boardroom

People:

Office Hours:

  • Prof. Istrail Mon. 3-4p.m.
  • Location CIT 523
  • Long Nguyen Mon. 10-11a.m.
  • Location CIT 527
  • Brendan Hickey Tues. 5-7p.m.
  • Location CIT 244
  • Sanjay Trehan Wed. 2-4p.m.
  • Location CIT 244
  • Rohan Maddamsetti Wed. 2-4p.m.
  • Location CIT 244

CSCI1950-L Timeline

The Adopt A Gene Project Has Been CANCELLED!!!

FRIDAY 05/09/08: Final Exam Given at 10:00a.m. in the SWIG Boardroom

THE FINAL EXAM IS AVAILABLE ON THE HOMEWORK PAGE

MONDAY 05/12/08: Final Exam Due to Erin (room 546) by 10:00a.m.

WEDNESDAY 05/14/08: Final Project Presenations 10:00a.m.-2:00p.m. in the SWIG Boardroom (LUNCH WILL BE SERVED):

Kamran Azam "Parsimony Haplotyping and Phasing"

Lenora Huang "Context-Free HMM and RNA Folding"

Amol Kapilla "Voting Theory and Bioinformatcs"

Dong Kim "Expectation Maximization Haplotyping and Phasing

Jessica Nadel "Trios and TDT Tests and Disease Associations"

Description [PDF]

This course is devoted to computational and statistical methods as well as software tools for DNA, RNA, and protein sequence analysis. The focus is on understanding the algorithmic and mathematical foundations of the methods, the design of the associated genomics tools, as well as on their applications. A comprehensive set of programming assignments provides a hands-on journey for the student into the complexities of real genomic data. These include: basic components of a genome assembler, mapping sets of sequences to the genome, as those generated by high-throughput sequencing like Illumina/Solexa and 454, a BLAST-like search tool, HMM algorithms for gene prediction, suffix trees, motif prediction for transcription factors promoters, and genome mapping of genetic variation of SNPs, haplotypes, and copy number. The course has several unifying themes such as alignment, comparative genomics, protein structure, the newly unveiled role of RNA in the regulatory genome, and the intertwining of statistics and algorithmics in the design of powerful genomic tools.

The course is open to computer and mathematical sciences students as well as biological and medical students. Both advanced undergraduates and graduate students are welcome. Biomed students compensate for programming assignments with comparable work for a final project. Graduate credit is obtained by a final project devoted to a research problem. Two grassroots projects are being built gradually by final projects of students in this class. Genomathica is a library of biologist-friendly-code-tinkering genomic tools written in Mathematica, and Cellarium is a programming language framework for bioinformatics workflows. The instructor taught evolutionary versions of this course in Departments of Biology, Computer Science, and Biochemistry and Cell Biology (in the Medical School).