Unveiling the Android App Market: Analyzing Google Play Store Data
Unveiling the Android App Market: Analyzing Google Play Store Data
This project aimed to analyze Google Play Store data to understand app market dynamics. It involved cleaning, categorizing, and visualizing data to gain insights into app performance, sentiment analysis, and market trends using data analytics and visualization tools in Power BI.
Apps Dataset (apps.csv): Provided app information, including category, rating, reviews, size, installs, type (free/paid), price, content rating, genres, and version details.
User Reviews Dataset (user_reviews.csv): Included user reviews, sentiment analysis, sentiment polarity, and subjectivity scores.
Data Collection: Extracted from Kaggle and imported to Power BI.
Data Wrangling:
Apps Handling
Removed duplicates, null values, and inappropriate rows.
Corrected data types and categorized android versions.
Splitted columns by delimiters for better readability.
Trimmed and formatted the data for consistency.
User Reviews Handling:
Cleaned and formatted the review data.
Removed null values and leading spaces.
Merged the two datasets using a common column (App name).
Created new measures using DAX for advanced analysis:
Sentiment Counts & Percentages: To quantify positive, negative, and neutral sentiments.
Average Rating & Reviews: Evaluated overall app performance.
Most Popular Android Version & Installed Apps: Identified market leaders.
2018 Installations: Validated insights against external reports.
Correlation Analysis:
Evaluated relationships between:
App Size & Rating
Installations & Reviews
Used KPIs for average rating, reviews, and popular Android versions.
Integrated slicers for dynamic data interaction.
Designed the dashboard with clean and interactive elements, including icons and clickable navigation.
DashBoard 1
Dashboard 2
Average Rating and Reviews:
The overall average app rating is 4.16, indicating generally positive user satisfaction.
The 'Clash of Clans' app received the highest average reviews (45M), significantly outperforming other apps.
Category Analysis:
Events apps lead in average ratings (4.5), followed closely by Education, Art & Design, and Parenting categories (4.4 each).
Arcade genre has the highest average installs (19.9M), with Racing (14.6M) and Casual (12M) genres also performing well.
Sentiment Analysis:
Health and Fitness apps show the highest positive sentiment (80%), suggesting strong user satisfaction.
Medical apps exhibit the highest negative sentiment (16%) among the top 10 categories with positive sentiment, indicating slight user dissatisfaction with deliveries and services.
Price and Installation Correlation:
There is no significant correlation between price and average installations, implying price does not influence the number of app downloads.
Higher installations of free apps do not necessarily mean that installation is influenced by the app type (free or paid); instead, this reflects a higher number of free apps available on the platform.
Size and Performance Correlation:
A weak correlation (0.06) exists between app size and average rating, showing that app size does not significantly affect user ratings.
Similarly, app size has minimal impact on average installations, despite a slight increase in installations with larger app sizes.
The analysis of the Google Play Store data provided valuable insights into the dynamics of the Android app market. By cleaning, categorizing, and visualizing the data, I was able to identify key trends and draw meaningful conclusions.
App Size and Performance: Although there was slight relationship between App size and rating but there is no positive correlation suggesting that Apps size has little or no influence on App performance.
Influence of Price on Installs: Price has minimal impact on the number of installations. Free apps dominate the market, highlighting that users value accessibility and quality over cost.
Engagement Through Reviews: Although there is a slight relationship between installations and reviews, it is not strong, indicating that while higher installations may drive engagement, other factors such as app quality and user experience also play significant roles.
Top-Performing Apps: The analysis confirmed that Google News was the most downloaded app in 2018, aligning with external sources like Mind Blogs. This reinforces the credibility of the analysis and showcases the effectiveness of data-driven insights.
Sentiment Analysis: The sentiment analysis provided insights into user satisfaction, with positive sentiment being dominant across app categories. This helps app developers and marketers understand user preferences and improve app features accordingly.
Overall, this project not only demonstrated the power of data analytics and visualization tools like Power BI but also emphasized the importance of data-driven decision-making in the tech industry. The ability to transform raw data into actionable insights is crucial for understanding market dynamics and staying competitive in the rapidly evolving app ecosystem.