Deduplication: A Quick Fix or The Way Forward for Storage?

By David Asher, Director of Product Management, Iron Mountain Digital, Iron Mountain Digital |  Storage, Data Backup, data deduplication Add a new comment

Having engulfed the IT team, the data avalanche is set to hit the wider business. Organizations now have to impose restrictions on email inboxes and local storage facilities because of the amount of data being produced by users across the network.

The problem may feel insurmountable; however a key way of reducing data is to examine how much information is simply replicated by multiple users. With standardized operating systems and applications, come thousands of identical files on legions of computers. Add to that identical attachments stored in multiple recipients’ inboxes and it’s easy to see how much duplicate documents add to an organization's storage requirements.

Unless organizations want to risk facing an all-engulfing data avalanche, the amount of data being stored has to be reduced, or managed more efficiently. Vendors have been quick to address this critical pain point; however it’s unclear as to whether these technologies have the capacity to cope with new developments in data such as bigger file sizes, multimedia formats and distributed data. Unless the situation is evaluated now, companies may find themselves left with a quick-fix solution that could quickly leave them in the same position as before.

Deduplication – A Lasting Solution?
Deduplication has quickly risen to the top of the IT agenda as a method to help reduce storage and power costs through streamlining the amount of information needing to be backed up. It also helps to address issues such as business continuity, e-discovery and compliance requests.

Deduplication technologies can take a myriad of forms, but there are several fundamental methodologies:

• Elimination of identical duplicate files across the network
• Incremental backups – finding the differences between today’s and yesterday’s files and only saving the changes
• File compression – further reducing the volume of data stored

These techniques are highly effective at stripping out a huge amount of backed up data that simply isn’t required. The technology can also work across distributed data centers, ensuring one centralized version of a document is backed up, rather than several different versions held on different devices.

However, deduplication really only tackles the initial symptoms of the data mountain and will not be able to match the growing average size of files as video and media files become increasingly popular. Compression is already implemented within these file formats, which will mean a reduction rate at the transmission stage. Effectively storage could get worse, rather than better.

Data Reduction – The Next Generation
Data reduction takes deduplication one step further, moving it from a reactive to a proactive approach to data management. The technique automates data movement and deletion from the desktop, which reduces the physical volume of data moving around the organization.

Policy driven, the technique ‘tags’ files that are deemed no longer required – this is established through a rules-based system that can be set up by administrators or IT managers. These files can then be extracted from their current position and either moved to the archive or deleted securely.

Data reduction should technically reduce the requirement to educate users about how to manage their own data storage effectively. Moving data management to an automated, policy-driven mechanism removes the need for workers to worry about where and when their data is backed up.

However, ensuring users understand why data reduction policies are in place and how they can help remove any blockages to the backup pipeline will always help an organization's long-term data strategies succeed. Common practices, such as using email inboxes as a secondary storage system for large documents such as PowerPoint presentations, will always continue. At the same time, IT managers should still encourage users to take a robust and rigorous approach to their individual storage habits.

One thing in storage remains constant – the amount of data we produce on a daily basis will continue to grow. IT managers who do not bury their heads in the sand and hope for the best are on the right track – data reduction policies need to be conceived and executed now to ensure employees aren’t brought to a halt by a data avalanche.

ITworld LIVE

StorageWhite Papers & Webcasts

White Paper

Using BD for Smarter Decision Making

This paper looks at new developments in business analytics and discusses the benefits analyzing big data bring to the business.

White Paper

Protecting Against Database Attacks and Insider Threats: Top 5 Scenarios

Read this new eBook to learn the top five scenarios and essential best practices for preventing database attacks and insider threats.

White Paper

The Best Way to Build a Cloud -- HP CloudSystem Matrix and HP 3PAR Utility Storage provide solid, flexible foundation

Learn how HP CloudSystem Matrix and HP 3PAR Utility Storage provide a solid, flexible foundation for your cloud environment.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Defining Tier One Storage in the Modern Data Center

This report defines "tier-1" storage in the modern IT world and in the data centers and services that support it. What was a simple environment just a few years ago with mainframes or a few large servers to be supported has evolved into a complex web of virtual machines, clouds, and expanding user expectations -- factors which demand and create flexibility, but do so in a way that pushes a lack of predictability upon the storage infrastructure. Learn what your criteria should be for tier-1 storage.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Converged Storage: Utility Storage - The Ideal Platform for Virtual and Cloud Computing

Server virtualization has transformed corporate IT -- companies have enjoyed major cost savings and have gained flexibility and efficiency. But this has also led to a proliferation of virtual machines and servers that threaten to overwhelm data movement and storage technologies. In this IDG Tech Dossier, learn how utility storage makes for massive consolidation, flexibility and scalability, so IT departments can reduce storage infrastructure and lower costs while improving their ability to respond to fast-changing needs of business units.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

See more White Papers | Webcasts

Ask a question

Ask a Question