Search this site
Embedded Files
Skip to main content
Skip to navigation
Data Science
Home
Spark & Scala
Data Science
Big Data
Python
R
Shiny
Swirl
Effective Online Writing
SQL
Machine Learning
Graph mining
Feature Engineering
Data Science
Home
Spark & Scala
Data Science
Big Data
Python
R
Shiny
Swirl
Effective Online Writing
SQL
Machine Learning
Graph mining
Feature Engineering
More
Home
Spark & Scala
Data Science
Big Data
Python
R
Shiny
Swirl
Effective Online Writing
SQL
Machine Learning
Graph mining
Feature Engineering
Data Science
Data Science Methodology
Business and Data Understanding:
Collection of data:
Exploring the data:
Analyzing the data and Deriving the answer:
Storytelling: Visualization and Communication
Taking Actions
Data Sets
Reading Resources
Data Science Methodology
Business and Data Understanding:
What is the problem that you are trying to solve?
How can you use data to answer the question?
What would you do if you had all the data?
Collection of data:
What data do you need to answer the question?
Where is the data coming from and how will you get it?
Which data is relevant?
Is the data that you collected representative of the problem to be solved?
Are there any privacy issues?
Exploring the data:
How does the data look like?
Are there any patterns?
What additional work is required to manipulate and work with the data?
Analyzing the data and Deriving the answer:
How to best build a model?
Do the results make sense?
How well does the model fit?
How can the model be validated?
Does the model used really answer the question or does it need to be adjusted?
Can the model be put into practice?
Storytelling: Visualization and Communication
In what way can the data be visualised to get the answers required?
How can you tell the story?
Can constructive feedback into answering the question be obtained?
Taking Actions
Making decisions
Data Sets
Recommender System
Graph
Reading Resources
Google Sites
Report abuse
Google Sites
Report abuse