The Apache Software Foundation overnight launched the general availability version of Hadoop 2, its open source framework for scalable distributed computing.
Hadoop 2 brings several new features not available from Hadoop 1, including high availability, data snapshots and federation for the Hadoop Distributed File System (HDFS), as well as support for Microsoft Windows.
NFS or network file system version 3 support is also built into Hadoop 2, with 64-bit sizes and offsets for very large files, and performance improvements.
The MapReduce layer has been superceded by YARN cluster resource management, which sits on top of HDFS to run data processing applications and services. MapReduce no longer handles the resource management and is becoming a user-land library instead.
With the help of YARN, applications can run inside Hadoop, providing multiple ways to process enterprise data.