Machine Learning

1) Machine learning course on coursera by Andrew Ng (link)

2) UC Irvine Machine Learning dataset repository (link)

3) Introduction to Statistical Learning (ISLR) book and R code (link) (book PDF) (R code) (videos) (playlist) (more advanced book)

Very good books on basic statistics (link)

p-value (video)

effect size (link)

4) Great explanation of the bias-variance tradeoff (link)

ISLR video explanation (link)

https://www.youtube.com/watch?v=VaN1RUDuioQ&list=PLOg0ngHtcqbPTlZzRHA2ocQZqB1D_qZ5V&index=5

http://scott.fortmann-roe.com/docs/BiasVariance.html

VERY GOOD picture

https://github.com/neelsoumya/basic_statistics/blob/master/bias_variance.png

5) Basic statistics (ANOVA, t-test, F-test, etc) (link)

Linear models, ANOVA, mixed effects, fixed effects, random effects and other basics (link) (tutorial 1) (tutorial 2)

Beautiful VERY GOOD tutorial on how most statistical tests are related to linear models (link)

Coursera course on basic statistics (link, github)

6) MIT OCW course on Artificial Intelligence by Patrick Winston (link) (course webpage)

search part 4 video search = choice

very good lecture on neural network, autoencoder and softmax (link)

goal trees and expert systems (link)

7) Area under curve (AUC) and ROC curve explanation (link) (video)

Precision recall curve (link) (link)

VERY GOOD Video tutorial based on ISLR material by Trevor Hastie and Rob Tibshirani (link)

VERY GOOD picture of precision, recall, confusion matrix, false positive, true positive, etc (link)

The number AUC has a probabilistic interpretation.

It is the probability that a randomly chosen positive example is ranked more highly than a randomly chosen negative example (link)

Sensitivity and specificity

https://github.com/neelsoumya/basic_statistics/blob/master/800px-Sensitivity_and_specificity.svg.png

Explanation of AUC (area under curve) (after the model is selected you can play around with threshold for logistic regression prediction) (picture courtesy Chris Penfold)

https://github.com/neelsoumya/basic_statistics/blob/master/auc_explanation.png

8) Techniques for visualizing and thinking about higher dimensions

How to use tSNE (link)

Explaining and exploring tSNE visually (link)

Great explanation of PCA (principal components analysis) (link) (link)

tSNE and PCA in your browser (link)

Eigenvectors and basis (link)

Uniform distribution in high dimensions (link)

Difference between UMAP and t-SNE (link) (link)

Coursera course on mathematics of PCA, dot product (link)

NCERT textbook on matrix and determinants (link) (link)

Mathematics of machine learning book by Marc Deisenroth (link)

9) Linear algebra basics

Determinant (link)

Eigenvectors and basis (link)

Linear algebra basics course MIT OCW (link)

NCERT textbook on matrix and determinants (link) (link)

Mathematics of machine learning book by Marc Deisenroth (link)

10) About the Wishart distribution (conjugate prior for the precision matrix of a multivariate normal distribution) (link)

The Wishart distribution is often used as a model for the distribution of the sample covariance matrix for multivariate normal random data, after scaling by the sample size. If x is a bivariate normal random vector with mean zero and covariance matrix sigma then you can use the Wishart distribution to generate a sample covariance matrix without explicitly generating x itself. Notice how the sampling variability is quite large when the degrees of freedom is small.

Sigma = [1 .5; .5 2]; df = 10; S1 = wishrnd(Sigma,df)/df

S1 = 1.7959 0.64107 0.64107 1.5496

df = 1000; S2 = wishrnd(Sigma,df)/df

S2 = 0.9842 0.50158 0.50158 2.1682

6) Multivariate normal distribution (from link)