What are the emerging alternatives to Hadoop?


And what chance do they have?

Tags: Hadoop
Topic: Big Data
Answer this Question


7 total
Vote Up (34)

Hi Declanchase2,

Here are a few articles with ideas about alternatives to Hadoop.

Hadoop Fatigue -- Alternatives to Hadoop

What are some promising open-source alternatives to Hadoop MapReduce for map/reduce?

Alternatives for Hadoop/MapReduce data storage and management

Vote Up (32)

Hi Declanchase2,

There are many Hadoop alternatives out there. The HPCC Systems platform is among them for tackling Big Data problems. Unlike Hadoop distributions which have only been available since 2009, HPCC is a mature platform, and provides for a data delivery engine together with a data transformation and linking system equivalent to Hadoop. The main advantages over other alternatives are the real-time delivery of data queries and the extremely powerful ECL language programming model.


More at http://hpccsystems.com

Vote Up (23)

welcome to our website:

------- http://www.likesurprise.com/ --------

if you like to order anything you like.

More details,

please just browse our website Quality is our Dignity;

Service is our Lift.

enjoy yourself.

thank you!!

Robert Metzger
Vote Up (16)


there are several alternative systems that can solve the same and more problems than Hadoop.

https://stratosphere.eu/ is a project that started out as a research project at a university. It has a novel model that allows for more operators than just map and reduce. (It also natively supports match, cross and more). It additionally allows for arbitrary complex job graphs. So you can compine these operators in any way you like. So you could have three inputs, that are joined, reduced, mapped and reduced (by another key). You can even write to as many outputs as you want.
Additionally, Stratosphere also supports iterative algorithms (often needed for Data Mining/Machine Learning). Since this is "natively" implemented into the system, Stratosphere does way better on those jobs than traditional hadoop systems.

There is an actively developed open source version of it on github: https://github.com/dimalabs/ozone 

Another project is Spark: http://spark.incubator.apache.org/ 

It allows applications to be written in Scala, which is an very powerful and expressive functional programming language (Stratosphere also supports Scala). It is really fast on job setup, hence it is very suited for small and medium sized data and ad-hoc evaluations.

Take also a look into the things Cloudera and its competitors are doing (Impala, Hive Stinger Initiative)


Disclaimer: I'm a developer of stratosphere ;)

Lenin Nair
Vote Up (8)
Lenin Nair
Vote Up (7)

Wrong link, here is the correct link

Lenin Nair
Vote Up (7)

Indeed there are. In a recent post at MSys, we had discussed about alternatives for Hadoop. Check it out.

Ask a question

Join Now or Sign In to ask a question.
A new analysis of Reddit comments shows which language’s developers seem to be the happiest - and which are the most foul-mouthed
In the wake of recent security breaches of medical databases, doctors can’t be too careful
Analytics 3.0 will go beyond internal use and become a driver of external products and services.
Many executives and organizations see big data as a panacea, but data and analytics can't address every problem you face.
A new study reveals that Java developers make the most while JavaScript programmers are the most wanted
Adatao is another startup promising easier data analytics for the masses. It stands out in a few ways.
New data from AngelList shows the top technology choices that startups are making
Aiming to expand its operational intelligence capabilities, Splunk today unveiled Splunk App for Stream, which the company says is a free addition to Splunk Enterprise and Splunk Cloud that makes it easy to capture wire data and combine it with the machine-generated data Splunk already captures and analyzes.
NomadList uses crowdsourced data to show which cities in the U.S. and the world are the best - and worst - for remote workers
Viewing the data center as the focal point of an ambitious set of technology initiatives, federal CIOs are working aggressively to slash server counts and consolidate facilities as they position their agencies to adopt cloud applications, roll out mobile technologies and support big data projects.