🔹Ask
🔹Prepare
🔹Process
🔹Analyze
🔹Share
🔹Act
1. A clear statement of the business task
2. A description of all data sources used
3. Documentation of any cleaning or manipulation of data
4. A summary of my analysis
5. Supporting visualizations and key findings
6. My top three recommendations based on the analysis
Business Task
We want to increase the number of our annual members and in order to do that, we want to target our casual user base. We need to find the differences between how annual and casual riders use the bikes differently. Once we have figured that out, can pinpoint what makes casual users switch.
The Stakeholders
The key stakeholders for this project are:
Director of marketing
Marketing analytics team
Executive team
Data Sources Used
I downloaded the customer trips data from the company's data server and stored appropriately giving a lot of attention to how the files are named and organized. The data was organized into 2 different forlders of .xls and .csv files. Both of these contained Microsoft Excel files. These different formats are needed when doing further analysis. Within the folders there were 12 additional folders containing the data for all the 12 months of 2021.
Processing The Data
ride_id
rideable_type (type of bike)
started_at
ended_at
start_station_name
start_station_id
end_station_name
end_station_id
start_lat (latitude)
start_lng (longitude)
end_lat
end_lng
member_casual (type of member)
ride_length
day_of_week
The Analysis
SELECT MAX(ride_length)
FROM `case-study-2021.case_study_2021.Feb_2021`
SELECT day_of_week
FROM `case-study-2021.case_study_2021.Feb_2021`
GROUP BY day_of_week
ORDER BY COUNT(day_of_week) DESC
LIMIT 1
=AVERAGE(N:N)
(column N has the ride_length data)
=AVERAGEIF(M:M,"member",N:N)
=AVERAGEIF(M:M,"casual",N:N)
(column M has member_casual data)
Sharing My Findings Through Visualizations
This visualization shows that the number of casual riders is the smallest at the beginning of the year i.e. just a few thousand and gradually increases in the coming months until July where it hits a peak monthly ridership of more than four hundred thousand. After that it starts to decline gradually in the coming months until the end of the year.
The story is a little different for annual members. The numbers are the smallest at the beginning of the year like casual riders. After that the numbers go up every month until hitting the peak around Aug-Sep period and then ridership decreases till December. It is important to note that June, July, August are the only months in which number of casual riders is greater than annual members. It means that although the overall number of annual member rides is increasing in the mid year months, the percentage of total monthly rides is decreasing. In order to confirm that, I created the following chart in Microsoft Excel.
This chart makes things much more clearer. Even though both type of riders have the smallest number of rides at the beginning of year, annual member rides constitute ~80% of all rides throughout the month. In the coming months percentage of casual riders increases and annual members' percentage decreases reaching the lowest of ~45% in July. This is in line with the previous findings. After July, annual members' percentage increases till year end.
Now that the analysis is completed, I can draw conclusions and create a set of recommendations for the marketing analytics team.
Recommendations
My top 3 recommendations for the marketing analytics team are:
Create discounted membership plans in the months from June to September for targeting casual riders.
Launch marketing campaigns with stats showing how annual membership is cheaper than casual rides in mid year.
Target the casual riders through social media by geolocating the ones closer to site of any festival, parks, theme parks and tourist attractions because that's where most casual riders would be going on weekends.