Hadoop 2.0 focuses on scale and innovation, with Yarn (next-generation MapReduce) and federation capabilities. Yarn will let users add their own compute models so that they do not have to stick to MapReduce. "We're really looking forward to the community inventing many new ways of using Hadoop," Baldeschwieler says. Expected uses include real-time applications and machine-learning algorithms. Scalable, pluggable storage is planned also.
Always-on capabilities in Version 2.0 will enable clusters with no downtime. Scalable storage is planned as well. General availability of Hadoop 2.0 is expected within a year.
This story, "Hadoop becomes critical cog in the big data machine," was originally published at InfoWorld.com. Follow the latest developments in business intelligence at InfoWorld.com. For the latest developments in business technology news, follow InfoWorld.com on Twitter.
Read more about business intelligence in InfoWorld's Business Intelligence Channel.