Sujets : mini projets
Deadline for submission : Mid-May
Lecture 8
Course 8a : Random Controlled Trials
Course 8a : Course8a.pdf,
Notebook : Lecture8a.ipynb. Dataset : village.csv
RCT and ecometry : E. Duflo
Course 8b : Deal with Observational data
Course 8b : Course8b.pdf
Lecture 8b : Lecture8b.ipynb
Observational data : G. Imbens
Course 8c : Deal with complex graphs
Course 8c : Course8c.pdf
Dataset : hotel_booking.csv
Times series and causality
An example of independence test
Conditional Independence tests
Lab 8
A recent judgement of the European court
Course 7: Interpretability
Lecture 7: Lecture 7.pdf. Tutorials : LIME.ipynb, SHAP.ipynb
Datasets : BostonHousing.csv, telco_data.csv
Lab 7 : Lab7.pdf. Datasets : winequality.csv.
Course 6 Decision Trees, Ensemble methods, Quantile regression
Lecture 6 (Part I): Lecture 6a.pdf.
Lecture 6 (Part II) : Lecture 6b.pdf
Notebook of Lecture 6 (Part I) : Lecture6a.ipynb. Notebook of Lecture 6 (Part II) : Lecture6b.ipynb
Course 5: Beyond OLS
Lecture 5: Lecture 5.pdf.
Lecture 5 notebook : Lecture5.ipynb
Lab 5 : Lab5.pdf. Dataset hitters.csv. Notebook : Lab5.ipynb
Course 4 : Basics of clustering
Lecture 4 : Lecture4.pdf.
Notebook of Lecture 4 : Lecture4.ipynb
More about hierarchical clustering
An improvement of K-means : X means
Details about linkage in Hierarchical Clustering
Silouhette in clustering. Other metrics
The elbow rule with the dataset
The scikit learn website about clustering
Practical Session 4 : PracticalSession4.pdf. Data : Live_20210128.csv. Notebook : Lab4.ipynb
Course 3 : PCA, NMF and t-SNE
Introduction to data visualization with the Seaborn library : IntroductionSeaborn.ipynb
Dataset : iris.csv
Lecture 3 : Lecture3.pdf.
Notebooks of Lecture 3 : pca_breast.ipynb , pca_vs_tsne.ipynb
Dataset : GSE5325_markers.csv
Practical Session 3 : PracticalSession3.pdf
Course 2 : Missing values
Lecture2 : Lecture2.pdf. Notebook of Lecture2 : Lecture2.ipynb.
Course 1 : Basics on Random Variables with Python
First lecture on Random Variables : CM_RandomVariables1.pdf. Notebook of Lecture 1 : Lecture1.ipynb
Practical session on Random Variables : PracticalSession1.pdf. The dataset AirLiquide : DataAirliquide.csv.
Basics on Python
Introduction to Python : IntroductionPython.ipynb
Introduction to the Numpy library: IntroductionNumpy.ipynb
Introduction to the Pandas library : IntroductionPandas.ipynb. The dataset president_heights.csv
Introduction to Python for data science : Tutorial or Other Tutorial