Building a Batch Pipeline for NYC Taxi Data with Spark