Bellabeat is a wellness company founded by Urška Sršen and Sando Mur that develops high-tech devices that monitor biometric and lifestyle data to help women better understand how their bodies work and make healthier choices. Since it's founding in 2013, the company has grown at a fast pace and continued its focus on empowering women about their own health and habits. Their products track and log customer steps, activity, mindfulness, hydration, and sleep helping women create a "balanced lifestyle plan" and reach their personal success.
Bellabeat's products include the Bellabeat App, Ivy Health Tracker, Time watch and tracker, Spring smart water bottle, B.YOU Yoga Mat, as well as other intuitive accessories. Bellabeat's membership provides users with 24/7 access to fully personalized guidance on nutrition, activity, sleep, health and beauty, and mindfulness based on their personal lifestyle and goals. These products are available for purchase on a growing number of online retailers as well as their own website. They have previously invested in traditional advertising media, such as radio, out-of-home billboards, print, television, but focuses on digital marketing extensively. This includes: Google Search, active Facebook and Instagram pages, engaging with consumers on Twitter, and video ads on Youtube and display ads on the Google Display Network.
Focus on one Bellabeat product and analyze smart device usage data in order to gain insight into how consumers are using their smart devices. Then, using this information, make high-level recommendations for Bellabeat's marketing strategy.
What are some trends in smart device usage?
How could these trends apply to Bellabeat customers?
How could these trends help influence Bellabeat marketing strategy?
Urška Sršen: Bellabeat's cofounder and Chief Creative Officer
Sando Mur: Mathematician and Bellabeat's cofounder; key member of the Bellabeat executive team
Bellabeat marketing analytics team: A team of data analysts responsible for collecting, analyzing, and reporting data that helps guide Bellabeat's marketing strategy.
Where it's from & what's in it: The data for this analysis comes from "FitBit Fitness Tracker Data" found on Kaggle. It was generated by respondents to a distributed survey via Amazon Mechanical Turk between March 12, 2016 - May 12, 2016. This dataset contains personal fitness tracker from thirty FitBit users who consented to the submission of their personal tracker data to include: minute-level heart rate, physical activity, and sleep monitoring. It also includes data for daily activity, steps, and heart rate.
Limitations: The data does not include key factors such as user age, gender, location, and lifestyle. Limitations also exist because of the age of the data, the timespan of the data, and the limited amount of users from which the data was collected.
Tools and platforms I'm using: For this analysis I will be using SQL for data cleaning & transforming and Tableau for creating data visualizations.
The dataset was first downloaded and opened as csv files in Excel. The formats for times and dates were corrected to the appropriate "date" and "time" formats in Excel. I then added a column for DayOfWeek using the function "=TEXT(B2,"dddd")" for six of the tables I will be using. Finally, the data was checked for null values and uploaded into BigQuery.
I then ran the queries below to identify the number of users (Id) per table.
These queries revealed that daily, hourly, and by-the-minute data was collected for 33 total users. Meanwhile the total users for the sleep and weight tables only showed data for 24 and 8 users respectively. This led me to not include the sleep table in my data analysis.
The dataset was then checked for inconsistencies using the COUNT, LENGTH, and SUBSTR functions in my queries. This allowed me to search for extraneous duplicates, extra spaces in text strings, and any typos in the data.
I first decided to pull data to check what days of the week users were most active or sedentary by running the following query:
## 1 to check what days of the week have users are most active/sedentary
SELECT DayOfWeek, AVG(TotalSteps) AS TotalSteps_DailyAvg, AVG(VeryActiveMinutes) AS VeryActMin_DailyAvg, AVG(SedentaryMinutes) AS SedMin_DailyAvg
FROM `my-first-project-0122.fitbit_data.dailyActivity`
GROUP BY DayOfWeek;
This then led me to check for what time of day the users were most active. I used this query three different ways, changing the operator and number to match the desired time of day (morning, afternoon, and evening).
## 2 to check what time of the day users are most active (change # at end of 21 for morn,aftn,even)
SELECT Id, COUNT(ActivityHour) AS ActivityCount
FROM `my-first-project-0122.fitbit_data.hourly_intensities`
WHERE TotalIntensity != 0 AND EXTRACT(HOUR FROM ActivityHour) < 12
GROUP BY Id;
Next, I was hoping to check the consistency of the user activity levels as well as to look for any possible patterns that might arise. This query allowed be to focus on the patterns of each user and look for commonalities among them.
## 3 to check the consistency or patterns of user activty levels
SELECT Id, DayOfWeek, AVG(TotalSteps) AS TotalSteps_DailyAvg, AVG(VeryActiveMinutes) AS VeryActMin_DailyAvg, AVG(SedentaryMinutes) AS SedMin_DailyAvg
FROM `my-first-project-0122.fitbit_data.dailyActivity`
GROUP BY Id, DayOfWeek
ORDER BY Id, DayOfWeek;
I then wanted to dive into the sleep patterns of the users and see what data could be useful to users that employ these technologies. Using this query allowed me to pull the required data.
## 4 to check the daily average sleep time of users
SELECT DayOfWeek, AVG(TotalMinutesAsleep) AS TotMinAsl_DailyAvg, AVG(TotalTimeInBed) AS TotTimeInBed_DailyAvg
FROM `my-first-project-0122.fitbit_data.sleepDay`
WHERE TotalSleepRecords = 1
GROUP BY DayOfWeek;
Finally, I wanted to explore the relationship between the users' sleep habits and their activity levels. I ran this query using an Inner Join of two different tables to compare the data and search for insights.
## 5 to compare daily average sleep w/ user daily activity/sedentary from 'dailyActivity' & 'sleepDay' tables
SELECT `my-first-project-0122.fitbit_data.dailyActivity`.Id, `my-first-project-0122.fitbit_data.dailyActivity`.DayOfWeek, AVG(`my-first-project-0122.fitbit_data.dailyActivity`.TotalSteps) AS TotalSteps_DailyAvg, AVG(`my-first-project-0122.fitbit_data.dailyActivity`.VeryActiveMinutes) AS VeryActMin_DailyAvg,
AVG(`my-first-project-0122.fitbit_data.dailyActivity`.SedentaryMinutes) AS SedMin_DailyAvg, AVG(`my-first-project-0122.fitbit_data.sleepDay`.TotalMinutesAsleep) AS TotMInAsl_DailyAvg, AVG(`my-first-project-0122.fitbit_data.sleepDay`.TotalTimeInBed) AS TotTimeInBed_DailyAvg
FROM `my-first-project-0122.fitbit_data.dailyActivity`
JOIN `my-first-project-0122.fitbit_data.sleepDay`
ON `my-first-project-0122.fitbit_data.dailyActivity`.Id = `my-first-project-0122.fitbit_data.sleepDay`.Id
AND `my-first-project-0122.fitbit_data.dailyActivity`.DayofWeek = `my-first-project-0122.fitbit_data.sleepDay`.DayOfWeek
GROUP BY `my-first-project-0122.fitbit_data.dailyActivity`.Id, `my-first-project-0122.fitbit_data.dailyActivity`.DayOfWeek
ORDER BY `my-first-project-0122.fitbit_data.dailyActivity`.Id, `my-first-project-0122.fitbit_data.dailyActivity`.DayOfWeek;
Curating the queries above allowed me to gather some very helpful data and make some important insights. I was able to find patterns in the users' sleep habits, activity levels, and consistency. This allowed me to gain some important insights into how users do and can use these devices to help them reach their personal goals. My next step was to use the data I gathered to create some data visualizations.
These graphs show how active users were on each day of the week by looking at the average total number of steps per day and the average very active minutes they had on each day of the week. The graphs show that users tend to be most active at the beginning of the work week (Monday & Tuesday) then see their activity levels dip throughout the rest of the week. Users also seem to take advantage of their day off on Saturday to get active again.
I then wanted to dive a little deeper into the activity levels of the users throughout the day. The majority of activity tends to occur from 6am-7pm with the highest activity levels happening at lunch time (noon-2pm) and right after work (5-7pm). There does seem to be a group of outliers around 9-10am, which could be a group of unemployed users, nightshift workers, and stay at home parents. This data, along with the data from the charts on the left, could be helpful in deciding when best to send users notifications of their activity levels and motivating reminders.
In these charts, I explored how consistent users were at maintaining their activity levels throughout the data collection time period, which went from 4/12/2016 - 5/12/2016.
The chart shows that many users' sedentary minutes remained relatively consistent throughout. However, users' activity levels tended to vary greatly. This information could help Bellabeat create programs that help users implement healthier life choices.
Next, I wanted to see the amount of sleep users were getting. I quickly noticed that users were getting less than the recommended 7-9hrs of sleep on Thursday-Saturday. While the reasons for this may vary, this information could be highly beneficial to users trying to achieve physical and mental health goals.
Seeing these trends also led me to wanting to see the affect such sleep patterns could be having on the activity levels of the users. I decided to create another chart that would better show this relationship.
These final graphs show the relationship between a user's activity level and their sleep patterns. I used trend lines to help show the patterns that these graphs uncovered. On Monday & Tuesday, when activity levels seem to be at their highest, users had the best sleep to steps ratios. Showing relationships like these could guide users to make decisions that help them reach their goals. This might also help them explain other effects like: stress and anxiety levels, feeling sluggish, and progress towards weight loss.
After analyzing the data I am prepared to make some recommendations to Bellabeat's stakeholders. These recommendations are focused on two Bellabeat products: the Bellabeat Ivy health tracker and the Bellabeat app. The goal of these recommendations is to help the Bellabeat company continue to grow its market and to better serve its current customers.
The sending of notifications, reminders, and progress updates to users to encourage healthy lifestyle choices. This can include lower sedentary times, consistent workouts, healthy eating and hydration habits, and positive sleep behavior.
Sharing of workouts and progress on social media as well as the ability to connect to and communicate with friends, family, and the local Bellabeat community.
Create a marketing campaign that includes new workout and diet programs to help users who struggle to create consistent healthy choices and manage their time. The campaign should also include relationships with popular gyms and fitness companies. I would also recommend advertising with meal plan services. Particularly those with healthy options that potential users might look to for healthy lifestyle changes.
The data provided gave us some great insights into how users use health tracker devices. However, I believe this is only the first step in a much longer journey in the right direction. As Bellabeat continues to grow, more data from their own devices and users will become available. This should include data that limited this analysis: age, occupation, location, eating habits, etc. Proper analysis should provide Bellabeat with a refined plan towards setting and reaching new goals.