Disruptive change in the database world: Get ready

Databases are on the verge of an epochal shift

By Billy Bosworth, VP and general manager at Quest Software, Network World |  Data Center/Servers, database, Hadoop Add a new comment

Major market shifts in the database world don't happen often.  When they do, they're massive, creating an impact that can last 10 to 20 years. When I entered the job market, it was right at the tail end of the last major shift from the mainframe to client/server.  Our CIO announced his (aggressive) vision of the client/server future and I noticed three primary responses from my more seasoned colleagues.  They either ridiculed it and ignored it; did a little nail biting, wondering what it all means; or got on board as fast as they could.

The new databases of today have the potential to be every bit as disruptive as the client/server wave, and the human reactions I saw nearly 20 years ago haven't changed all that much.  These disruptors bring change, but while it sounds cliche, the truth is that with this change comes tremendous opportunity. 

Let's try and classify some of these changes into a few big buckets so we can get a feel for how they could affect the data center as well as individuals from a career perspective. 

NoSQL Databases

For a whole lot of reasons that are beyond the scope of this article, modern applications are being asked to scale and perform at radically higher rates than ever before.  When someone builds these modern applications, did you ever wonder what manages the data behind them?

Here's a hint: It's often not a relational database, and the reason is something called the CAP theorem. 

The CAP theorem -- first articulated by Eric Brewer in 2000 -- essentially explains what the Rolling Stones knew long ago: "You can't always get what you want."  CAP is an acronym for consistency, availability and partition tolerance (scale out).  To drastically oversimplify, it says: "Pick any two, but you can't have all three." 

Relational databases are fantastic at consistency and availability.  But partition tolerance?  Not so much.  That's a problem, because what's often needed most in these modern applications is the ability to scale out across many database servers, even at the expense of something else. That something is transactional consistency.  So what if your status doesn't immediately appear on all your friends' walls? Not a huge deal.  But if that application is running slow because it can't scale -- that's a show stopper.  This is where the NoSQL database comes into play.

There is a lot of debate over what to call these things, but I think NoSQL conveys the point that this is not a relational database, nor does it have things like the transactional consistency we take for granted in a relational model. (There's no reason that, in theory, they cannot use SQL, but the name caught on, so we're going to run with it.)  What they can do is scale out like nobody's business without the complexities of data modeling and schema management, which most developers just love.

MapReduce and Hadoop

NoSQL databases strive to meet the scale demands for large-scale Web applications.  However, they don't do much to address the business need for analyzing the resulting masses of data.  One solution is to go back to the future and create a brute-force way of scaling and processing.  Google's answer was MapReduce, which most of the world now knows in its open source form, Hadoop. 

Apache Hadoop can churn through petabytes of data across thousands of nodes running nothing more than commodity hardware. The reduction in storage costs alone can be astounding, but there are definitely tradeoffs.  Setting up Hadoop is not trivial, and your army of resources who know SQL will have to learn about things like HIVE and PIG (I know... it is hard to write that with a straight face) to write MapReduce jobs to handle your business needs.  While many identify these problems as a step backwards, it does appear that Hadoop is here to stay. 

Specialized Databases

These aren't new, but they are getting a lot of attention, and for good reason. Relational databases have become like the Borg, assimilating various technologies along the way to become all things to all applications.  They're now big and complicated.  Specialized database advocates argue that, rather than trying to solve every problem with one solution, it is better to build the right specific technology to solve a specific problem. Doing so, they say, dramatically decreases cost and increases performance.


Originally published on Network World |  Click here to read the original story.

ITworld LIVE

Data Center/ServersWhite Papers & Webcasts

White Paper

The Forrester Wave™: Disaster Recovery Services Providers

Improvements in disaster recovery plans and broad business continuity strategies are top-of-mind concerns for leading enterprises today and recovery time is now measured in hours and minutes not days. These key insights are discussed in the 2010 Forrester Wave Report.

White Paper

Roadmap to the Cloud Summary HP Brochure

This white paper reveals the key steps you need to take in order to build an effective cloud computing infrastructure. Start building your cloud step-by-step today.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Forrester Whitepaper: IT Operations Managers Must Rethink Their Approach to Private Cloud

Organizations of all types are attracted by the promises of private cloud computing, but few actually have the virtual maturity to be successful. This Forrester report reveals the latest virtualization trends so you can see how far your peers are in their journey to the private cloud. Read on and discover best practices for improving virtualization in order to prepare for the cloud.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Building Cloud-Optimized Data Center Networks white paper

Enterprises are turning to the Cloud to improve business agility, reduce expenses and accelerate business innovation. Cloud computing redefines the way IT assets are deployed and consumed and dramatically affects the way data center networks are architected and managed. Conventional hierarchical data center networks built to support traditional IT architectures can't meet the security, agility and price/performance requirements of virtualized cloud computing environments. This white paper reviews the impact of cloud computing on data center networks and describes HP's approach to building simpler, more secure and automated networks that fully meet the stringent performance, security, reliability and agility demands of the new data center in the Cloud.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Seven Priorities for Integrated Network Management - How HP Intelligent Management Center Delivers an Enterprise-class Solution

This white paper describes the major requirements for network management solutions to help the organizations become more profitable, efficient and reliable.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

See more White Papers | Webcasts

Ask a question

Ask a Question