Exploratory Data Analysis (EDA) is an approach to analyzing data sets in order to summarise their major characteristics, which commonly involves the use of statistical graphics and other data visualisation techniques. EDA refers to the critical process of perform investigation on data to discover patterns, to spot anomalies, to test hypothesis and to check assumptions with the help of summary statistics and graphical presentation. This step is very important especially when we want modeling the data to apply Machine learning. Plotting in EDA consists of Histograms, Box plot, Scatter plot and many more. Through the process of EDA, we can ask to define the problem statement or definition on our data set which is very important.
The process where EDA is applied in Data Science