Day 1

Essential Statistic for Data Science

How is data collected:-

  • Primary data
  • Secondary Data

Data Type:-

      • Whole number
      • Got by counting the values
      • Eg: 8, 10, 12, 231
      • Decimal number
      • Got by measuring with something.
      • Eg: 3.2, 5.8
      • Have 2 values
      • Eg: Yes/No, True/false, Male/Female
      • More than 2 values
      • Eg: Marital status 
      • ordered category
      • Eg: rating 1-5 satisfaction 

Types of Data Analytics

Machine Learning

AI, Machine Learning, Deep Learning

Artificial Intelligence (AI) - Machines behave like human

Machine Learning (ML) - How to make the machine behave like human

Deep Learning (DL) - Make machine learn, understand and respond.

Steps in Machine Learning Process

    • Remove missing values
    • Identify outliers
  1. Visualize the data
    • Convert data into chart/graph
    • Descriptive & Dianostic

Statistics for Data Analytics

Types of Statistics

    • Describe basic statistical values
    • Eg: Mean, mode, median, range, variance, standard deviation
    • Hypothesis Testings/ assumptions
      1. Null Hypothesis- No statistical difference between groups/not enough evidence
      2. Alternative Hypothesis- Statistical difference between group
    • Techniques
      1. Min-max Scalling
      2. Standardizaion