Motivation to do Track 06
Theme: The organization of values extracted from databases as a probability distribution function of a random variable is a powerful tool for detecting anomalous patterns. In particular, the procedure for obtaining probability distributions makes it easier to identify extremely high or low values, i.e., outliers. This can serve as a basis for carrying out more assertive inspections.
This learning path is designed to develop the following skills:
Examples of random variables.
How to derive the probability distribution of random variables from the probability of events.
Types of continuous random variables probability distributions: Continous Uniform, Exponential distribution, Standard Normal or Gaussian
Inverse standard Normal distribution
Student's T distribution
These concepts will be applied in the following practices with real data:
The distribution of weight, dimension, and value per HS6 code from the OCDB dataset.
How to fit a particular distribution?
Employing standard deviation to do an assessment of outliers as a certain percentage of the population.
Estimation of probability to find fraud as a sum of Bernoulli.
A primer approach to computing total time spent in a system: Exponential distribution application.
Gaussian mixture and its application.
The next links will help in this learning journey. Have a good learning journey.
The journey map of Track 06
Badges
Normal Distrib,
Student T Distrib,
Inverse Distrib.
HS6 and Weight
Fit Distrib,
Detect Deviation
Total Time
Gaussian Mixture
1. Concepts & Definitions
1.1. Continous random distribution of probability
1.2. Normal distribution of probability
1.3. Standard normal distribution of probability
1.4. Inverse standard normal distribution
1.6. Inverse Student's T distribution
2. Problem & Solution
2.1. Weight, dimension, and value per HS6
2.2. How to fit a distribution
2.3. Employing standard deviation
2.4. Total time spent in a system
1. Concepts & Definitions
2. Problem & Solution