This lesson only covers Big Data. At the end of the lesson, students should be able to
Describe briefly about Big Data
Explain how big data is implemented in organizations.
Describe the potential careers in Big Data
The purpose of this section is just to give you an overview of the two tools in big data which are Hadoop and MapReduce. So, what is Hadoop? Based on Edureka website, Hadoop is a framework that allows you to first store Big Data in a distributed environment, so that, you can process it parallely. There are basically two components in Hadoop which are HDFS and YAN Processing.
A good source to read about Hadoop is in Edureka blog https://www.edureka.co/blog/what-is-hadoop/
This video mentioned about MapReduce. MapReduce is part of the Hadoop framework. This video also mentioned about the data is processed in parallel. Read more about MapReduce in Edureka. https://www.edureka.co/blog/mapreduce-tutorial/
Write a summary of your understanding on the tools in Big data. Take note of the terms such as Hadoop, MapReduce, HDFS, Parallel Processing that is relevant to Big data tools.