Click on the button below to access the protocols that we used in the wetlab portion of the workshop. We used the QIAGEN Genomic Tip kit because it usually yields the cleanest and longest DNA, but there are many other HMW DNA kits available and may be better for different needs and sample types.
Click on the buttons below to download the "reads" used for the exercise and the answer.
Directions for the exercise start on slide 17 of Day 1 slides, but I highly encourage you to look at all of the material prior. The goal of this exercise is to understand how coverage, read length, and error rate affect genome assembly.
Phase 1: Short-read assembly
Consider the amount of overlap of the reads and evenness of coverage
Simulation of 4bp reads, no errors
Read orientation: triangles should be pointing up, black square positioned slightly to the left
Linear genome
Phase 2: Hybrid assembly
Consider the amount of overlap of the reads and evenness of coverage
Simulation of 4bp reads + long reads
Blacked out squares in long reads are “errors” and can be any base
We tried to record the lectures, but unfortunately we had technical issues and couldn't record most of them.
Note that Day 3 has some repeat information from Days 1 and 2 because it was hybrid.
If you would like to have PDFs of the slides or use the material please email Lauren (lmlui at lbl dot gov)
Andrew Wilson, Pacific Northwest NationalLaboratory
Lauren Lui, Lawrence Berkeley National Laboratory
Torben Nielsen, Lawrence Berkeley National Laboratory
The bug we found in Flye during the workshop has now been fixed.
John-Marc Chandonia and Lauren Lui, Lawrence Berkeley National Laboratory
Ben Allen, Oak Ridge National Laboratory, KBase
Elisha Wood-Charlson, Lawrence Berkeley National Laboratory, KBase
Carlos Goller, North Carolina State University
Lauren Lui, Lawrence Berkeley National Laboratory