What are the emerging alternatives to Hadoop?


And what chance do they have?

Tags: Hadoop
Topic: Big Data
Answer this Question


7 total
Vote Up (33)

Hi Declanchase2,

Here are a few articles with ideas about alternatives to Hadoop.

Hadoop Fatigue -- Alternatives to Hadoop

What are some promising open-source alternatives to Hadoop MapReduce for map/reduce?

Alternatives for Hadoop/MapReduce data storage and management

Vote Up (30)

Hi Declanchase2,

There are many Hadoop alternatives out there. The HPCC Systems platform is among them for tackling Big Data problems. Unlike Hadoop distributions which have only been available since 2009, HPCC is a mature platform, and provides for a data delivery engine together with a data transformation and linking system equivalent to Hadoop. The main advantages over other alternatives are the real-time delivery of data queries and the extremely powerful ECL language programming model.


More at http://hpccsystems.com

Vote Up (21)

welcome to our website:

------- http://www.likesurprise.com/ --------

if you like to order anything you like.

More details,

please just browse our website Quality is our Dignity;

Service is our Lift.

enjoy yourself.

thank you!!

Robert Metzger
Vote Up (15)


there are several alternative systems that can solve the same and more problems than Hadoop.

https://stratosphere.eu/ is a project that started out as a research project at a university. It has a novel model that allows for more operators than just map and reduce. (It also natively supports match, cross and more). It additionally allows for arbitrary complex job graphs. So you can compine these operators in any way you like. So you could have three inputs, that are joined, reduced, mapped and reduced (by another key). You can even write to as many outputs as you want.
Additionally, Stratosphere also supports iterative algorithms (often needed for Data Mining/Machine Learning). Since this is "natively" implemented into the system, Stratosphere does way better on those jobs than traditional hadoop systems.

There is an actively developed open source version of it on github: https://github.com/dimalabs/ozone 

Another project is Spark: http://spark.incubator.apache.org/ 

It allows applications to be written in Scala, which is an very powerful and expressive functional programming language (Stratosphere also supports Scala). It is really fast on job setup, hence it is very suited for small and medium sized data and ad-hoc evaluations.

Take also a look into the things Cloudera and its competitors are doing (Impala, Hive Stinger Initiative)


Disclaimer: I'm a developer of stratosphere ;)

Lenin Nair
Vote Up (7)

Indeed there are. In a recent post at MSys, we had discussed about alternatives for Hadoop. Check it out.

Lenin Nair
Vote Up (5)

Wrong link, here is the correct link

Lenin Nair
Vote Up (5)

Ask a question

Join Now or Sign In to ask a question.
Access to more and better data, and the capability to visualize it in more meaningful ways, will make teachers better able to perform the data analysis they've been doing all along.
IT leaders need to learn how to manage the evolving legal, privacy and compliance issues of SMAC contracts.
Financial institutions use many technologies to fight crime, but much of the work comes too late, focusing on suspicious activity, like uncharacteristic charges or money transfers, after it happens.
Only six weeks after its release, Apple’s new programming language appears to already be on its way to replacing Objective-C
Researchers find that by touching our phones all the time we’re leaving our biological mark on them
A non-programming profession which writes a lot of code looks at which languages can solve its problems the fastest
Companies are focusing more and more attention on building out big data analytics capabilities and data scientists are feeling the pressure.
Text analytics company Luminoso, a 2010 MIT Media Lab spinoff that helps its customers make sense out of unstructured data, has raised a $6.5 million Series A round of funding. The 25-person outfit plans to use the funds for new hires in sales, product management and client services as well as to expand its product line.
In his keynote at Spark Summit 2014 in San Francisco today, Databricks CEO Ion Stoica unveiled Databricks Cloud, a cloud platform built around the Apache Spark open source processing engine for big data.
MapR, which distributes a commercial software platform based off the Apache Hadoop big data management open source project, has secured a $110 million funding round led by Google.
Join us: