What are the emerging alternatives to Hadoop?


And what chance do they have?

Tags: Hadoop
Topic: Big Data
Answer this Question


7 total
Vote Up (30)

Hi Declanchase2,

Here are a few articles with ideas about alternatives to Hadoop.

Hadoop Fatigue -- Alternatives to Hadoop

What are some promising open-source alternatives to Hadoop MapReduce for map/reduce?

Alternatives for Hadoop/MapReduce data storage and management

Vote Up (28)

Hi Declanchase2,

There are many Hadoop alternatives out there. The HPCC Systems platform is among them for tackling Big Data problems. Unlike Hadoop distributions which have only been available since 2009, HPCC is a mature platform, and provides for a data delivery engine together with a data transformation and linking system equivalent to Hadoop. The main advantages over other alternatives are the real-time delivery of data queries and the extremely powerful ECL language programming model.


More at http://hpccsystems.com

Vote Up (17)

welcome to our website:

------- http://www.likesurprise.com/ --------

if you like to order anything you like.

More details,

please just browse our website Quality is our Dignity;

Service is our Lift.

enjoy yourself.

thank you!!

Robert Metzger
Vote Up (10)


there are several alternative systems that can solve the same and more problems than Hadoop.

https://stratosphere.eu/ is a project that started out as a research project at a university. It has a novel model that allows for more operators than just map and reduce. (It also natively supports match, cross and more). It additionally allows for arbitrary complex job graphs. So you can compine these operators in any way you like. So you could have three inputs, that are joined, reduced, mapped and reduced (by another key). You can even write to as many outputs as you want.
Additionally, Stratosphere also supports iterative algorithms (often needed for Data Mining/Machine Learning). Since this is "natively" implemented into the system, Stratosphere does way better on those jobs than traditional hadoop systems.

There is an actively developed open source version of it on github: https://github.com/dimalabs/ozone 

Another project is Spark: http://spark.incubator.apache.org/ 

It allows applications to be written in Scala, which is an very powerful and expressive functional programming language (Stratosphere also supports Scala). It is really fast on job setup, hence it is very suited for small and medium sized data and ad-hoc evaluations.

Take also a look into the things Cloudera and its competitors are doing (Impala, Hive Stinger Initiative)


Disclaimer: I'm a developer of stratosphere ;)

Lenin Nair
Vote Up (3)

Indeed there are. In a recent post at MSys, we had discussed about alternatives for Hadoop. Check it out.

Lenin Nair
Vote Up (2)

Wrong link, here is the correct link

Lenin Nair
Vote Up (1)

Ask a question

Join Now or Sign In to ask a question.
Big data analytics are driving rapid growth for public cloud computing vendors with revenues for the top 50 public cloud providers shooting up 47% in the fourth quarter last year to $6.2 billion, according to Technology Business Research Inc.
According to a new dataset, the big names in technology lag well behind actors, politicians and athletes in terms of global cultural significance
Every business, it seems, needs a data scientist, but not everyone knows what to look for. The four qualities of a good data scientist described here will help you first write a job description and then evaluate candidates for your data scientist vacancy.
Big data analytics are driving rapid growth for public cloud computing vendors with revenues for the top 50 public cloud providers shooting up 47% in the fourth quarter last year to $6.2 billion, according to Technology Business Review Inc.
The Big Data space is heating up – to the point that many pundits already see it as the over-hyped heir to "cloud." The hype may be a bit much, but Big Data is already living up to its potential, transforming entire business lines, such as marketing, pharmaceutical research, and cyber-security.
Without the computing power to assess all the data coming from connected devices, GE suggests that enterprises won't realize the full potential of the industrial Internet.
With Teradata QueryGrid, your data warehouse can now intelligently use the functionality of multiple, heterogeneous processing engines, including Hadoop.
With the release of the Hortonworks Data Platform 2.1 version of its Hadoop distribution, Hortonworks is packing in new enterprise features, including data access, data governance, data management, security and operations.
Pivotal unveils the Pivotal Big Data Suite, an all-you-can-eat software, support and maintenance platform that's designed to provide access to all the technologies required to build a business data lake with a single pricing metric.
A new study of the questions asked on Stack Exchange reveals what issues are giving web developers headaches

White Papers & Webcasts

See more White Papers | Webcasts

Join us: