Data Deduplication and Backup

By Dan Blacharski  Add a new comment

In the world of backup and disaster recovery, we’re constantly looking for performance advantages and ways to save space, especially as the storage requirements grow exponentially over time.

Data deduplication sounds like a complicated subject, but it’s really just what it sounds like—removing redundant data in storage. Really a simple concept. It provides a great way to keep storage costs under control, gain faster access, and ultimately, boost productivity. Deduplication is especially relevant in a virtual server environment which may have virtual machines scattered in hundreds of different locations, with each virtual machine containing a large amount of duplicate data. And here’s the big issue: Storage may be cheap, but companies grow quickly and their data requirements grow even faster. We need key technology to keep the storage requirements down. Data deduplication simply lets you use your storage and backup media for a longer period of time, since you can fit more data on it than would otherwise be possible.

Here’s how it works: The deduplication technology looks for redundant data, and when it encounters a piece of redundant data, it places a marker that points to the duplicate data block instead of actually storing multiple instances of the same data block. How much space can you save? It’s significant. Some studies have shown that you can get a ratio of 20:1. That’s a big savings in terms of not having to buy more storage media, and also in terms of administrative overhead.

Another advantage is in the area of disaster recovery. As part of your disaster recovery plan, if you replicate all of your data to an off-site data center, this too delivers advantage in terms of lowered requirements for off-site storage space. Deduplication can actually be applied in several different areas; in local backup as well as in remote backup. If you have multiple remote sites across a WAN, deduplication starts to really save tons of space, and a great side effect of it is that it can also improve throughput in sending stored data, since there’s less of it to send.

ITworld LIVE

StorageWhite Papers & Webcasts

White Paper

Using BD for Smarter Decision Making

This paper looks at new developments in business analytics and discusses the benefits analyzing big data bring to the business.

White Paper

Protecting Against Database Attacks and Insider Threats: Top 5 Scenarios

Read this new eBook to learn the top five scenarios and essential best practices for preventing database attacks and insider threats.

White Paper

The Best Way to Build a Cloud -- HP CloudSystem Matrix and HP 3PAR Utility Storage provide solid, flexible foundation

Learn how HP CloudSystem Matrix and HP 3PAR Utility Storage provide a solid, flexible foundation for your cloud environment.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Defining Tier One Storage in the Modern Data Center

This report defines "tier-1" storage in the modern IT world and in the data centers and services that support it. What was a simple environment just a few years ago with mainframes or a few large servers to be supported has evolved into a complex web of virtual machines, clouds, and expanding user expectations -- factors which demand and create flexibility, but do so in a way that pushes a lack of predictability upon the storage infrastructure. Learn what your criteria should be for tier-1 storage.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Converged Storage: Utility Storage - The Ideal Platform for Virtual and Cloud Computing

Server virtualization has transformed corporate IT -- companies have enjoyed major cost savings and have gained flexibility and efficiency. But this has also led to a proliferation of virtual machines and servers that threaten to overwhelm data movement and storage technologies. In this IDG Tech Dossier, learn how utility storage makes for massive consolidation, flexibility and scalability, so IT departments can reduce storage infrastructure and lower costs while improving their ability to respond to fast-changing needs of business units.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

See more White Papers | Webcasts

Ask a question

Ask a Question