Statistics in Genomic Research

Recurring prediction/estimation statistics:

External Links:

    • Matthew Stephens' guide to succinct explanations of many relevant statistics concepts

  • Basic online statistics textbook with online calculators

    • John McDonald's handbook for biological statistics

    • Ewan Birney's list of "Five statistical things I wished I had been taught 20 years ago"

  • Learn all about t-SNE

    • Download a FREE copy of Introduction to Statistical Learning here

    • Listen Data's useful guide to the most common 15 types of regression

    • Nature Biotechnology Primers, of specific interest:

      • Analyzing 'omics data using hierarchical models

      • How does multiple testing correction work?

      • How to map billions of short reads onto genomes

      • SNP imputation in association studies

      • Understanding genome browsing

      • What are decision trees?

      • What is the expectation maximization algorithm?

      • What is principal components analysis?

      • How does eukaryotic gene prediction work?

      • What is a support vector machine?

      • What are DNA sequence motifs?

      • How does DNA sequence motif discovery work?

      • How does gene expression clustering work?

      • How do RNA folding algorithms work?

      • What is a hidden Markov model?

      • What is Bayesian statistics?

      • What is dynamic programming?