Vrinda Store Analysis

Using PostgreSQL       

In this project, we analyze the Vrinda store data to create an annual sales report for 2022. So that, Vrinda can understand their customers and grow more sales in 2023. 

ABOUT DATA

The Product Category column contains 8 different categories of items sold by Vrinda Store, including Kurta, Set, Western Dress, Top, Saree, Blouse, Ethnic Dress, and Bottom. The Sales Channel column contains 7 different channels used by Vrinda Store to sell their products, including Myntra, Ajio, Amazon, Flipkart, Meesho, Nalli, and Others.

The data also includes shipping details such as City and State, with 1860 different cities and 36 different states listed. This information could be used to analyze Vrinda Store's sales patterns and customer demographics, as well as identify which products and sales channels are the most popular. It could also help in identifying areas where the store could potentially expand its operations or improve its shipping logistics.


DATA CLEANING

Using Excel

Step by step data cleaning process - Watch the following Video.

DATA PROCESSING

Important Tips :

If we have a lot of data and we create new columns by using different formulas, then excel will work slowly so we can copy, and use paste special function and paste as only value in original column.

Step by step data pre-processing - Watch the following Video.

ER DIAGRAM CREATION

How to Create separate Data for each table using original Vrinda store data, please watch following Videos.

DATA ANALYSIS

Using  PostgreSQL

 1) Total sales and Total orders for each date.


Overall, the query will return a result set that shows the total sales and total orders for each date, sorted by the total sales and total orders in descending order.



This is a SQL query that is similar to the previous one, but it adds a "LIMIT" clause at the end to return only the first row of the result set, i.e., the date with the highest total sales and total orders. 


2) Total sales and total number of orders for each gender.

Overall, the query will return a result set that shows the total sales and total number of orders for each gender, sorted by the total sales in descending order.


3) Total number of orders for each order status

Overall, the query will return a result set that shows the total number of orders for each order status.

4) Total number of orders and total sales for each state

Overall, the query will return a result set that shows the total number of orders and total sales for each state, sorted by the total sales in descending order, and limited to the top 10 states.


5) Total sales amount, and age for each age group

Overall, the query will return a result set that shows the total number of customers, total sales amount, and age for each age group, sorted by the sales amount in descending order.

6)  Total sales for each channel

Overall, the query will return a result set that shows the total sales for each channel, sorted by the total sales in descending order.

7) Total sales amount and order count for each product category

Overall, the query will return a result set that shows the total sales amount and order count for each product category, sorted by the sales amount in descending order.

8) Total sales for the top-selling product in each state

Overall, the query will return a result set that shows the total sales for the top-selling product in each state.

9) Average order amount for each age group

Overall, the query will return a result set that shows the average order amount for each age group. The customers are categorized into age ranges using a CASE statement, and the average order amount is calculated and grouped by the corresponding age group.

10) Total number of orders for each combination of channel and whether the order is a B2B order 

11)  Correlation between Age and orders

This query calculates the correlation coefficient between the "amount" column in the "order_details" table and the "age" column in the "customers" table. The correlation coefficient measures the strength and direction of the linear relationship between two variables, and takes a value between -1 and 1. A value of 0 indicates no linear correlation, while a value of 1 or -1 indicates a perfect positive or negative linear correlation, respectively.

The query achieves this by joining the two tables on the "cust_id" column, and then using the "corr" function to calculate the correlation coefficient between the "amount" and "age" columns.


12) Most Popular Product size

This query is selecting the "size" column from the "order_details" table, and then groups the rows by the distinct values of "size". It then counts the number of rows in each group using the COUNT function, and renames the count column as "size_count". Finally, it orders the result set in descending order based on the count of each group and limits the result set to the first row using the LIMIT keyword.

Essentially, the query is trying to find the most commonly ordered size by counting the number of orders for each size and returning the size with the highest count.

13) Most Popular Product size by age group

This query groups orders by the customer's age group and the size of the product ordered. The CASE statement is used to create the age groups based on the customer's age. The result includes three columns: age_group, size, and size_count. The age_group column represents the customer's age group, the size column represents the size of the product ordered, and the size_count column represents the number of products ordered with that size by customers in that age group. The results are ordered by size_count in descending order.

REPORT

Vrinda Store Workbook Here Click Please to view.

INSIGHTS

CONCLUSION

Target women customers of age group (30-49 yrs) living in Maharashtra, Karnataka and Uttar Pradesh by showing ads/offers/coupons available on Amazon, Flipkart and Myntra.

Vrinda Store Analysis PPT

In this PPT, we analyze the Vrinda store data annual sales report for 2022.

Vrinda Store Analysis

Download Material

How to download and use material :

1) Go to the following link.

2) Download and Open Vrinda_Store_Tables_Create.sql file in Postgresql and Run it.

3) Open the data folder download each table data then Add data in each table by using Import/Export option in Postgresql.

4)  Then we perform analysis. (for helping , use Vrinda Analysis Using Postgresql.sql file)


MATERIAL LINK