We first used the Songkick API to gather all of the artists that played in New York City on the Night of November 15th, about 60 total, which we put into a csv. Using string concatenation we were able to loop through all of the artists with a for loop, passing the LastFM API request for top tracks of each artist to the object "r."
Our intention was to find data sets with voice activated streaming information. We had little success as this data is still very new so we created our own data sets to compare through obtaining API keys from our chosen sources.
We created scripts in Python to pull down information through the APIs to output into json and xml files. We then created new Python scripts to parse through the json and xml data and create CSV files. All scripts are available in the GitHub repository.
With our CSV files of New York City venues, top streamed tracks, and top streamed artists, we altered our scripts with specific parameters to narrow down our results to a specific month or day. These instructions are within the code. We then extracted what data we needed for mapping our results – for instance our scripts directly pulled latitude and longitude coordinates so this made for an easy mapping transition. We also manually cleaned some of the data in Sarah’s result XML (artists and their top tracks) and inserted a column into the Songkick artist csv with the artist's top tracks and number of listeners.
After importing the CSV files into Google MyMaps we had a fun time looking at live NYC shows compared to what the rest of the country is streaming.