What could cause persistent client disconnects while trying to run Hadoop Zookeeper through a cloud provider?

rtrembley

We are trying to set up Zookeeper in a virtual environment through our cloud provider. On a regular basis we are getting client disconnects. Is there a common cause for these disconnects to keep happening?

Topic: Big Data
Answer this Question

Answers

2 total
jimlynch
Vote Up (14)

You might want to check the documentation or discuss Hadoop on their mailing list. There are probably some other folks who have seen similar issues.

Here's the link:

http://hadoop.apache.org/common/index.html

rousseau
Vote Up (10)

You didn't mention your timeout config, but it should be set pretty high for your situation.  There can be a lot of latency issues that potentially cause problems in a virtual environment.  Try configuring timeout at something high like 10 seconds and see if you notice an improvement.      

Ask a question

Join Now or Sign In to ask a question.
Big data analytics are driving rapid growth for public cloud computing vendors with revenues for the top 50 public cloud providers shooting up 47% in the fourth quarter last year to $6.2 billion, according to Technology Business Research Inc.
According to a new dataset, the big names in technology lag well behind actors, politicians and athletes in terms of global cultural significance
Every business, it seems, needs a data scientist, but not everyone knows what to look for. The four qualities of a good data scientist described here will help you first write a job description and then evaluate candidates for your data scientist vacancy.
Big data analytics are driving rapid growth for public cloud computing vendors with revenues for the top 50 public cloud providers shooting up 47% in the fourth quarter last year to $6.2 billion, according to Technology Business Review Inc.
The Big Data space is heating up – to the point that many pundits already see it as the over-hyped heir to "cloud." The hype may be a bit much, but Big Data is already living up to its potential, transforming entire business lines, such as marketing, pharmaceutical research, and cyber-security.
Without the computing power to assess all the data coming from connected devices, GE suggests that enterprises won't realize the full potential of the industrial Internet.
With Teradata QueryGrid, your data warehouse can now intelligently use the functionality of multiple, heterogeneous processing engines, including Hadoop.
With the release of the Hortonworks Data Platform 2.1 version of its Hadoop distribution, Hortonworks is packing in new enterprise features, including data access, data governance, data management, security and operations.
Pivotal unveils the Pivotal Big Data Suite, an all-you-can-eat software, support and maintenance platform that's designed to provide access to all the technologies required to build a business data lake with a single pricing metric.
A new study of the questions asked on Stack Exchange reveals what issues are giving web developers headaches

White Papers & Webcasts

See more White Papers | Webcasts

Join us:
Facebook

Twitter

Pinterest

Tumblr

LinkedIn

Google+