A recent 2025 project that highlights possible film blockbusters based on historical data within the company.
Company: [Streaming Guide Company - Withheld Name for Anonymity]
Role: Data Analyst
Duration: Ongoing
Tools Used: LightDash, Google Sheets, and Google Slides
The company is a global leading streaming guide that helps you find where to stream any new and popular titles on various platforms.
As part of my Data Analyst role, I created the list and graphics which are then used to leverage journalists to use and write about our data. This assists in getting valuable and relevant backlinks.
The primary goal of the report is to collect data that would allow us to create a prediction on upcoming films that may become blockbusters in a given period of time.
LightDash - Data collection, filtering by country, date range by release date and date range to cover events from a specific period of time. In this case, the upcoming release date range would be for April to June, while the events would be from Jan-Mar
Google Spreadsheets - Where data will be cleaned, and transformed into insights and visualization
Google Slides - For the final visualization and presentation of findings
For this project, I worked on collecting the data. The team wanted to find titles that could become potential blockbuster hits in the months of April to June.
To get these predictions, we collected historical data from the months of Jan-Mar for titles that were to be released for Apr-June to evaluate which titles could become potential blockbuster hits.
With all the available data, I decided that the best way would be to create a normalized weighted score for each title to fairly determine the popularity.
In order to do that, I discussed with the team which metrics would be the most important ranking factors for a potential title. After discussion, I normalized the data on Google Spreadsheets, calculated the weighted score and sorted by the strongest to lowest scores.
Once the Top 10 list was finalized, I created the graphics for the final PR.
Note: This case study is a generalized summary of my workflow and contributions. It does not contain any proprietary data or represent real internal metrics. Company names and details have been withheld or anonymized for confidentiality.