Introduction:
Definition,
Scope of Data Science,
Softwares required for Data Science,
Introduction of Python.
Descriptive Statistics with Python:
Data Preparation,
Exploratory Data Analysis,
Estimation of different statistics,
Statistical Inference with Python:
The Frequentist Approach,
Measuring the Variability in Estimates,
Point and Interval Estimates,
Hypothesis Testing.
Regression Analysis with Python:
Linear Regression,
Sparse Model,
Logistic Regression.
Supervised Learning:
Learning Curves,
Training, Validation and Test,
Generalities Concerning Learning Models,
Support Vector Machines,
Random Forest.
Unsupervised Learning:
Similarity and Distances,
Rand Index, Homogeneity,
Completeness and V-measure Scores,
Silhouette Score
Taxonomies of Clustering Techniques:
K-means Clustering, Spectral Clustering,
Hierarchical Clustering, Adding Connectivity Constraints,
Comparison of Different Hard Partition
Clustering Algorithms Network Analysis:
Basic Definitions in Graphs,
Social Network Analysis, Centrality,
Ego-Networks, Community Detection.
Statistical Natural Language Processing for Sentiment Analysis:
Data Cleaning,
Text Representation,
Bi-Grams and n-Grams.