
What is Hadoop?
Hadoop is a Java-based open-source framework, which uses a distributed storage and distributed processing approach to break-down large data sets. Hadoop distributes the large blocks of data to nodes in a computer cluster, which then processes data in parallel, using Jar files sent to them byMapReduce, the programming/processing model used in Hadoop.