Superstore Profitability Analysis
End-to-End Retail Analytics
End-to-End Retail Analytics
Retail businesses often experience strong revenue growth while profitability remains uneven across categories and regions.
This project investigates profitability drivers and margin risks within the Superstore retail dataset to identify actionable improvements.
Superstore transactional dataset containing:
Orders and sales transactions
Product categories and sub-categories
Regional and state-level sales performance
Revenue, profit, and quantity metrics
Data cleaning and preparation were performed in Excel using Power Query.
Key steps:
removing inconsistencies in raw data
validating revenue and profit calculations
creating structured tables for analysis
deriving analytical metrics used across the workflow
SQL was used to validate aggregated metrics and perform advanced analytical queries.
Analyses included:
segment contribution to total sales and profit
ranking products by profitability within categories
identifying top performing products
Pareto analysis to detect profit concentration across products
These queries ensured analytical accuracy before further exploration.
Python was used to explore patterns in the dataset using statistical and visual analysis.
EDA included:
profit distribution analysis
category-level performance comparison
identification of outliers and loss-making segments
hypothesis testing around profitability drivers
An executive-focused Power BI dashboard was created to visualize key findings and support strategic decision making.
Dashboard components include:
overall revenue, profit, and quantity KPIs
sales and profit trends over time
category and sub-category profit contribution
regional and state profitability analysis
Technology and Office Supplies drive the majority of total profits
Furniture generates significant revenue but consistently underperforms in margins
Central and South regions show structural profitability weaknesses
Profit risk increases when low-margin categories concentrate in weak regions
prioritize high-margin product categories such as Technology
review pricing strategy in underperforming regions
restructure or optimize low-margin product lines
rebalance category distribution across regions
This project demonstrates a full analytics workflow across multiple tools to diagnose profitability drivers in the Superstore retail dataset.
The analysis pipeline includes:
Excel — Data cleaning, validation, and pivot-based exploratory analysis
SQL — Advanced analytical queries to validate metrics and perform ranking and Pareto analysis
Python — Exploratory data analysis and hypothesis testing to investigate profitability patterns
Power BI — Executive-level dashboard summarizing insights and supporting business decisions
Each stage of the workflow is documented in separate repositories to demonstrate proficiency across different analytics tools.