Data files are provided by IMDb that was released on December 2017:
IMBd website: https://www.imdb.com/
Data Files: ftp://ftp.fu-berlin.de/pub/misc/movies/database/frozendata/
The file obtained from the Internet movie database consisted of many separated files on movie information throughout many years with a file size greater than 2GB. The data was then parsed from the files and combined into one csv file that had all movie information to work with. The code that shows how this was done can be found on the Github link.
More info on the preprocessing can be found under the Source code + Instructions tab.
During the formatting procedure, the data had been cleaned from noise and filtered. Unnecessary information from the data columns that were found to be not needed was filtered out on the final csv file. Several data frames were created to help sort certain functionality for the visualization that help organize certain features.