All about data science
Data Scientist
Prove/disprove hypothesis.
Information & Data Gathering
Data debating.
Algorithms & ML Models.
Communication.
Data Engineer
Build Data Driven Platforms.
Operationlize Algorithms & Machine Learning models.
Data Integration.
Monitoring.
Data Visualization
Story Telling
Build Dashboard & Other Data Visualizations.
Provide insight through visual means.
Process Owner
Project Management
Manage StakeHolder Expectations.
Maintain a Vision.
Facilitate.
Evengilize.
require Math & Statistics
is Hypothesis Driven
require a range of competencies/roles.
Supervised Learning
Model trained on historical data. Resulting model can be used to predict to make predictions on new data.
Use Case: Predicting a value based on patterns discovered in previous data.
Regression
Linear Regression
Classification
Decision Trees
Unsupervised Learning
Algorithms finds trends and patterns in data, without prior training on historical data.
Use Case: Describing your data based on statistical analysis.
Clustering
K Means
Reinforcement Learning
Model uses a feedback loop to iteratively improve its performance.
Use Case: Learning how to best solve a problem based on trial and error. Often used in game programming.