Khue Nam Do
Aspiring business analyst - Data Science Class of 28'
Khue architected our news media pipeline, writing custom web scrapers to pull articles and mentions from outlets across the country and then cleaning and unifying those diverse sources into a single dataset. She sourced additional datasets on media volume and tone, engineered meaningful metrics—such as daily coverage counts and sentiment scores—and translated those insights into clear, interactive Tableau visualizations that demonstrate how mainstream attention aligns with spikes in fan engagement. Khue also wove these findings into our narrative page, crafting explanatory text that guides readers through the data critique and highlights the relationship between traditional press coverage and social media momentum.
Keane Davis
Keane specializes in data visualization and database management. They built our dashboards and charts, turning raw social media streams into clear, interactive insights. They overlaid attendance vs. media-mention trends in a single graph to show how coverage drives fan turnout and added a global text-size slider so users can adjust all dashboard text for different visual needs
Shuvam Chatterjee
With a background in statistics and a passion for sports analytics, Shuvam led our data scraping efforts, through gathering online data such as Google searches and WNBA historical events. He used this to build the interactive timeline and the search data visualization. He also wrote for the Data Critique page.
Emma Do
Aspiring software engineer student - Data Science major, CS minor Class of 27'
Emma led the end to end design and development of our site’s core pages. Beyond crafting a clean layout and selecting a unified color palette, she scraped Google Trends metrics to assembling attendance and viewership logs. She embedded visualizations into the narrative, and wrote detailed annotations for every chart to guide readers through our data critique. Besides, she created the Public Code page to ensure clarify and honesty in the data analysis process. She ensured that all page themes aligned perfectly, optimized content flow for accessibility and responsiveness, and maintained consistent styling across all sections.
Andrew Madrigal
Developed the Bibliography page with a searchable, categorized reference list. He implemented collapsible sections organized by resource type, ensured consistent APA/MLA citation formatting, and configured external links to open in new tabs for seamless user experience. Similarly updated the Data Critique to account for both important datasets.
Dylan Butler
Designed the Narrative & Data Critique page. Assisted with implementation of StreamLit data visualizations with embedded code. Built toggles for methodological notes and reflections, and integrated accessibility features across the website.
Every team member contributed to writing the narrative, interpreting findings, and weaving together data-driven stories with human insights. In the closing sections, every team member contributed reflections on practical implications, recommendations for league stakeholders, and personal takeaways from the project.
We established weekly writing sprints where team members pitched story angles, aligned on tone, and peer-reviewed drafts to ensure clarity and cohesion. Through this structured yet flexible approach, our narrative bridges data and human experience, inviting readers to engage with both the numbers and the stories that shape the WNBA’s digital presence.
Home page: Emma Do
Narrative page: Khue Nam Do, Shuvam Chatterjee, Dylan Butler, Emma Do
Data Critique page: Khue Nam Do, Emma Do, Shuvam Chatterjee, Andrew Madrigal
Public Code page: Emma Do, Khue Nam Do
Annotated Bibliography page: Dylan Butler, Khue Nam Do, Emma Do, Shuvam Chatterjee
About page: Emma Do
Our project’s architecture is shaped by a series of deliberate technology and data‐processing choices, each selected to maximize reproducibility, collaboration, and accessibility. We anchored all data ingestion, cleaning, and sentiment‐analysis workflows in Python within Google Colab, web scraping via the MediaCloud API to create our own dataset of media headlines from 1995 to 2025 relating to the WNBA. We utilized keyword-based tagging (specifically looking for controversies, milestones, player movement, sponsorships and other non-specific headlines), sentiment analysis, topical focus and publication frequency to understand how the WNBA is represented.
For our visualizations, we opted to build interactive dashboards in Tableau; its drag-and-drop interface lets users dynamically filter by date, platform, and sentiment, which are functionality that static Python charts alone could not easily provide . On the front end, we developed a Streamlit application to embed our Python analyses, specifically integrating accessibility features such as adjustable text size, keyboard navigation, and alt-text for every chart. These are decisions directly informed by data-feminist and accessibility theories . Finally, we chose Google Sites as our hosting platform to seamlessly embed both Tableau dashboards and the Streamlit app without additional server infrastructure, ensuring that updates to our analysis notebooks and visual components appear in real time on the live site. These choices are most evident in the “Interactive Dashboards & Resources” section of our site, where you can inspect the Colab notebooks, explore live dashboard embeds, and review annotated code snippets from end to end.
This project wasn't made available if it was not with the help of these resources below:
"Unraveling the Influence of Wealth on Voting Behavior" by DIGHUM 100 Final Project Group 6-6
Link: https://sites.google.com/berkeley.edu/wealthandvoting/narrative?authuser=0
How To Create A Free Website With Google Sites by Stewart Gauld
Link: https://www.youtube.com/watch?v=NN6h7bTRMzw
Emma Do: quyendo2004@berkeley.edu
Khue Nam Do: khue_do@berkeley.edu
Shuvam Chatterjee: shuvamc@berkeley.edu
Dylan Butler: dylanbutler@berkeley.edu
Keane Davis: keaned@berkeley.edu
Andrew Madrigal: andrew_madrigal@berkeley.edu