Deduplication: A Quick Fix or The Way Forward for Storage?

By David Asher, Director of Product Management, Iron Mountain Digital, Iron Mountain Digital |  Storage, Data Backup, data deduplication Add a new comment

Having engulfed the IT team, the data avalanche is set to hit the wider business. Organizations now have to impose restrictions on email inboxes and local storage facilities because of the amount of data being produced by users across the network.

The problem may feel insurmountable; however a key way of reducing data is to examine how much information is simply replicated by multiple users. With standardized operating systems and applications, come thousands of identical files on legions of computers. Add to that identical attachments stored in multiple recipients’ inboxes and it’s easy to see how much duplicate documents add to an organization's storage requirements.

Unless organizations want to risk facing an all-engulfing data avalanche, the amount of data being stored has to be reduced, or managed more efficiently. Vendors have been quick to address this critical pain point; however it’s unclear as to whether these technologies have the capacity to cope with new developments in data such as bigger file sizes, multimedia formats and distributed data. Unless the situation is evaluated now, companies may find themselves left with a quick-fix solution that could quickly leave them in the same position as before.

Deduplication – A Lasting Solution?
Deduplication has quickly risen to the top of the IT agenda as a method to help reduce storage and power costs through streamlining the amount of information needing to be backed up. It also helps to address issues such as business continuity, e-discovery and compliance requests.

Deduplication technologies can take a myriad of forms, but there are several fundamental methodologies:

• Elimination of identical duplicate files across the network
• Incremental backups – finding the differences between today’s and yesterday’s files and only saving the changes
• File compression – further reducing the volume of data stored

These techniques are highly effective at stripping out a huge amount of backed up data that simply isn’t required. The technology can also work across distributed data centers, ensuring one centralized version of a document is backed up, rather than several different versions held on different devices.

However, deduplication really only tackles the initial symptoms of the data mountain and will not be able to match the growing average size of files as video and media files become increasingly popular. Compression is already implemented within these file formats, which will mean a reduction rate at the transmission stage. Effectively storage could get worse, rather than better.

Data Reduction – The Next Generation
Data reduction takes deduplication one step further, moving it from a reactive to a proactive approach to data management. The technique automates data movement and deletion from the desktop, which reduces the physical volume of data moving around the organization.

Policy driven, the technique ‘tags’ files that are deemed no longer required – this is established through a rules-based system that can be set up by administrators or IT managers. These files can then be extracted from their current position and either moved to the archive or deleted securely.

Data reduction should technically reduce the requirement to educate users about how to manage their own data storage effectively. Moving data management to an automated, policy-driven mechanism removes the need for workers to worry about where and when their data is backed up.

However, ensuring users understand why data reduction policies are in place and how they can help remove any blockages to the backup pipeline will always help an organization's long-term data strategies succeed. Common practices, such as using email inboxes as a secondary storage system for large documents such as PowerPoint presentations, will always continue. At the same time, IT managers should still encourage users to take a robust and rigorous approach to their individual storage habits.

One thing in storage remains constant – the amount of data we produce on a daily basis will continue to grow. IT managers who do not bury their heads in the sand and hope for the best are on the right track – data reduction policies need to be conceived and executed now to ensure employees aren’t brought to a halt by a data avalanche.

    Add a comment

    Post a comment using one of these accounts
    Or join now
    At least 6 characters

    Note: Comment will appear soon after you have activated your account.
    Obscene/spam comments will be removed and accounts suspended.
    The information you submit is subject to our Privacy Policy and Terms of Service.

    ITworld LIVE

    StorageWhite Papers & Webcasts

    White Paper

    ESG ~ HP StoreOnce: the Next Wave of Data Deduplication

    Leveraging deduplication in backup environments yields significant advantages. The cost savings in reducing disk capacity requirements change the economics of disk-based backup. For some organizations, it allows disk-based backup-and, importantly, recovery-to be extended to additional workloads in the environment. For others, deduplication makes it possible to introduce disk-based backup where it may not have been feasible before.

    White Paper

    Evaluator Group: Storage Federation - IT Without Limits (Analysis of HP Peer Motion with Storage Federation)

    As the role of IT increases within organizations, the need to move data when and where it is needed is critical to support emerging business requirements. This has become increasingly difficult due to the huge growth of data volumes. This white paper sponsored by HP + Intel evaluates a solution that aims to enable the movement of data without physical limitations. Read now and see how this could enable agility and efficiency.

    White Paper

    HP Converged Storage Sets the Stage for the Next Era of Computing

    Enterprise storage has undergone many changes in recent years - with converged storage and infrastructure 2.0 paving the way for reduced IT infrastructure costs and greater performance. This report discusses the latest trends that are setting the stage for the next era of computing. Learn about the new infrastructure and storage trends that are changing the way business storage works today.

    White Paper

    AppAssure vs Acronis

    In this study of data protection for environments with virtual and physical servers running Windows, openBench Labs tested AppAssure Backup and Replication software v 4.7 and Acronis Backup & Recovery 11. Both solutions utilize block-based technology to unify data protection operations.

    White Paper

    Guaranteeing 100% Backup Recovery

    The single biggest challenge for IT personnel involved in the data protection process is making sure that their backups are recoverable every time. Management and users won't remember the ninety-nine successful recoveries but they will always remember the one failure.

    See more White Papers | Webcasts

    Ask a question

    Ask a Question