We would like to express our gratitude to Dr. Scott Caddy, Rosa M Norton, Hannah Ellis, and Anooj Kansara for their dedicated teaching and guidance throughout the DIGHUM 100 coursework. Their lessons provided us with the necessary theoretical knowledge of concepts such as Marxism, feminism, and intersectionality, which were invaluable to the formation of our project.
Special thanks to Rosa, our main contact, for her constant support and constructive feedback, which improved the quality of our work.
Additionally, we acknowledge the contributions of various resources that offered essential information and inspiration. We are particularly grateful to Kaggle users @suparnabiswas (rune), @elizabethearhart (Elizabeth Earhart), @imuhammad (Muhammad Nakhaee), @rkibria (Raihan Kibria), and GitHub users @spock0724 (AAyres), @matthieubaillard for providing open-access datasets and/or public projects (links below). Their work provided a valuable foundation for our data analysis and research.
Billboard Hot-100[2000-2023] data with features
Billboard Hot 100 yearly rankings with lyrics (1959-2023)
Billboard Hot 100 weekly rankings without lyrics (1958-2024)
Spotify songs with lyrics (1958-2020)
Gender & genre data of music artists (accessed 2017)
Gender data of music artists (accessed 2023)
Meet the Team
Paul Simon Castro
Hello, my name is Paul Simon Castro, and I am a senior at UC Berkeley majoring in Applied Math and Data Science. I have a keen interest in statistical inference and probability. For this project, I helped with augmenting existing datasets, producing visualizations, and exploring/analyzing trends in gender representation over time.
Brandon Lee Concepcion
Hello! I’m Brandon Lee Concepcion and I am a rising junior at UC Berkeley majoring in Data Science and Computer Science. I am particularly interested in Applied Mathematics, Machine Learning, and Education. For this project, I helped with some exploratory data analysis, In particular, I created visualizations on the similarity of song lyrics to Marxist terms, as well as general sentiment analysis on those lyrics.
Brooklyn Grant
Hi! My name is Brooklyn Grant (goes by Brook for the most part), and I am a rising Senior at UC Berkeley majoring in Legal Studies with a hope for a minor in Digital Humanities. I am passionate about the entertainment industry–with an emphasis on music and how it affects us! For this project, I led web design and helped with the creation of our intersectionality angle and how successful artistry lies within an artist's identity.
Max Huang
Hi, my name is Max Huang. I’m a rising junior at UC Berkeley majoring in Data Science and Linguistics. I like hiking and I recently got into Japanese mahjong. For this project, I mainly helped prepare datasets and led the trend analysis/visualization of selected words in lyrics.
Wiktor Rajca
Hi, I am Wiktor Rajca. I am a rising senior at UC Berkeley majoring in Applied Mathematics and Data Science. I’m interested in econometrics and international politics. For this project, I helped with visualizations, data cleaning, data mining and writing the intersectionality section of the narrative and data critique.
Colin Rondon
Hi! I am Colin Rondon, and I am a rising Junior at UC Berkeley majoring in Data Science, and I am transferring from De Anza College. I am passionate about the intersection of data and entertainment, basketball, and technology. For this project, I helped with the ideation and development of the lyrical analysis with brand-name mentions, as well as researching context for our visualization’s findings and writing sections of related narratives
Jerrae Schroff
I’m Jerrae Schroff. I’m a rising senior at UC Berkeley, double majoring in Computer Science and Data Science, and I take interest in computer and data security. For this project, I’ve helped with writing the project narrative, data critique, and providing feedback on visualizations.