BIG DATA-Architecture

The four Vs of Big Data are –

Volume – Talks about the amount of data

Variety – Talks about the various formats of data

Velocity – Talks about the ever increasing speed at which the data is growing

Veracity – Talks about the degree of accuracy of data available

Hadoop applications have wide range of technologies that provide great advantage in solving complex business problems.

Core components of a Hadoop application are-

1) Hadoop Common

2) HDFS

3) Hadoop MapReduce

4) YARN

Data Access Components are - Pig and Hive

Data Storage Component is - HBase

Data Integration Components are - Apache Flume, Sqoop, Chukwa

Data Management and Monitoring Components are - Ambari, Oozie and Zookeeper.

Data Serialization Components are - Thrift and Avro

Data Intelligence Components are - Apache Mahout and Drill.