Week 5 Blog Post

Post date: Sep 27, 2017 4:59:2 PM

This week the team did more research into other IQP projects that we can learn from, in addition to the 4 projects that we already found. We added these to the background section talking about the past work at the VPC.

We also did some analysis of the existing data sets, particularly about the CSV data that Nicola gave us. The data stored in the database is split between 14 tables, corresponding to a variety of different topics including churches, city islands, and stores. Across all of the tables, there are nearly 10000 rows of data, and most of the tables have 30 or more columns. An average table consists of about 49000 fields of data, and is nearly half a megabyte in size. The largest table has nearly 5000 lines, over 100000 fields, and is over 2 megabytes large. Unfortunately, a sizable chunk of each of these tables has not been filled in with data. Overall, around 25% of the total fields are either empty or filled with null data. One of the tables is missing nearly 75% of its data.