What are the emerging alternatives to Hadoop?

declanchase2

And what chance do they have?

Tags: Hadoop
Topic: Big Data
Answer this Question

Answers

7 total
jimlynch
Vote Up (34)

Hi Declanchase2,

Here are a few articles with ideas about alternatives to Hadoop.

Hadoop Fatigue -- Alternatives to Hadoop
http://www.bytemining.com/2011/08/hadoop-fatigue-alternatives-to-hadoop/

What are some promising open-source alternatives to Hadoop MapReduce for map/reduce?
http://www.quora.com/What-are-some-promising-open-source-alternatives-to...

Alternatives for Hadoop/MapReduce data storage and management
http://www.dbms2.com/2011/05/14/hadoop-mapreduce-data-storage-management/

Vote Up (31)

Hi Declanchase2,


There are many Hadoop alternatives out there. The HPCC Systems platform is among them for tackling Big Data problems. Unlike Hadoop distributions which have only been available since 2009, HPCC is a mature platform, and provides for a data delivery engine together with a data transformation and linking system equivalent to Hadoop. The main advantages over other alternatives are the real-time delivery of data queries and the extremely powerful ECL language programming model.

 

More at http://hpccsystems.com

jiexiao
Vote Up (23)

welcome to our website:

------- http://www.likesurprise.com/ --------

if you like to order anything you like.

More details,

please just browse our website Quality is our Dignity;

Service is our Lift.

enjoy yourself.

thank you!!

Robert Metzger
Vote Up (16)

Hi,

there are several alternative systems that can solve the same and more problems than Hadoop.

https://stratosphere.eu/ is a project that started out as a research project at a university. It has a novel model that allows for more operators than just map and reduce. (It also natively supports match, cross and more). It additionally allows for arbitrary complex job graphs. So you can compine these operators in any way you like. So you could have three inputs, that are joined, reduced, mapped and reduced (by another key). You can even write to as many outputs as you want.
Additionally, Stratosphere also supports iterative algorithms (often needed for Data Mining/Machine Learning). Since this is "natively" implemented into the system, Stratosphere does way better on those jobs than traditional hadoop systems.

There is an actively developed open source version of it on github: https://github.com/dimalabs/ozone 

Another project is Spark: http://spark.incubator.apache.org/ 

It allows applications to be written in Scala, which is an very powerful and expressive functional programming language (Stratosphere also supports Scala). It is really fast on job setup, hence it is very suited for small and medium sized data and ad-hoc evaluations.

Take also a look into the things Cloudera and its competitors are doing (Impala, Hive Stinger Initiative)

 

Disclaimer: I'm a developer of stratosphere ;)

Lenin Nair
Vote Up (7)
Lenin Nair
Vote Up (7)

Indeed there are. In a recent post at MSys, we had discussed about alternatives for Hadoop. Check it out.

Lenin Nair
Vote Up (6)

Wrong link, here is the correct link

Ask a question

Join Now or Sign In to ask a question.
Analytics 3.0 will go beyond internal use and become a driver of external products and services.
Many executives and organizations see big data as a panacea, but data and analytics can't address every problem you face.
A new study reveals that Java developers make the most while JavaScript programmers are the most wanted
Adatao is another startup promising easier data analytics for the masses. It stands out in a few ways.
New data from AngelList shows the top technology choices that startups are making
Aiming to expand its operational intelligence capabilities, Splunk today unveiled Splunk App for Stream, which the company says is a free addition to Splunk Enterprise and Splunk Cloud that makes it easy to capture wire data and combine it with the machine-generated data Splunk already captures and analyzes.
NomadList uses crowdsourced data to show which cities in the U.S. and the world are the best - and worst - for remote workers
Viewing the data center as the focal point of an ambitious set of technology initiatives, federal CIOs are working aggressively to slash server counts and consolidate facilities as they position their agencies to adopt cloud applications, roll out mobile technologies and support big data projects.
The one-two punch of consumers' eagerness to share their opinions and their unfamiliarity with business contacts spells opportunity.
With its new SLA, Splunk assures its Splunk Cloud customers that their machine data analytics will be available 100 percent of the time.
Join us:
Facebook

Twitter

Pinterest

Tumblr

LinkedIn

Google+

randomness