RainStor releases next-gen big data repository

New RainStor 4.5 structured data repository adds 'intra column' deduplication capabilities

By Lucas Mearian, Computerworld |  Software, big data, data management Add a new comment

RainStor today announced the next generation of its online data repository. The update adds data deduplication capabilities and improved optimization for storing computer-generated historical data.

[ See also: Big data: How a trucking firm drove out big errors ]

The new RainStor 4.5 version can run on a storage area network (SAN) or network-attached storage (NAS) system as a repository for structured data. The latest generation of the software is aimed at capturing and then serving up online transaction processing (OLTP) data sets, user log data and metadata.

The software comes with a resource description framework (RDF) interface to automatically join data from relational databases to the repository.

RainStor 4.5 adds "intra-column" deduplication, which is a single-instance storage feature that captures one copy of repetitive data and creates a pointer back to it for search queries. For example, online transaction databases may capture the same URL address over and over filling up millions of columns with repetitive data. RainStor will capture only a single copy of the URL and use it over and over when retrieving online transactions related to that particular online site.

"We're able to reduce the data footprint by 95% because of deduplication," said Ramon Chen, RainStor's vice president of product management.

The product's user interface replicates a standard relational database management system versus a data warehouse. Thus, administrators won't need additional training, Chen said.

Unlike an Oracle or a SQL database, which are optimized to find a single record among millions, RainStor's repository pre-analyzes data it stores. The product places millions of related records in large blocks that can be quickly retrieved by a computer system's memory for faster search results.

"It's like a global positioning system. In the search window, you can type in the city or type in the exact address. An Oracle database will immediately search for an exact address, which can take a long time. With RainStor, it first gets you to the city, then it narrows the search down to the exact address," Chen said.

Lucas Mearian covers storage, disaster recovery and business continuity, financial services infrastructure and health care IT for Computerworld. Follow Lucas on Twitter at @lucasmearian , or subscribe to Lucas's RSS feed . His e-mail address is lmearian@computerworld.com .

Read more about databases in Computerworld's Databases Topic Center.


Originally published on Computerworld |  Click here to read the original story.

ITworld LIVE

SoftwareWhite Papers & Webcasts

White Paper

Activities Streams Base An Integrated Social Layer

The enterprise social software market is exploding thanks to converging trends of consumerization, cloud, and mobile. In this must-read report, "The Forrester Wave: Activities Streams, Q2 2012", Forrester Research Inc. evaluated five social software vendors with core strengths in the stream based on the overall strength of vendors' current offerings, a clear product strategy, and vendor market presence. In a detailed look at the space, Forrester named Yammer as a leader.

White Paper

ESG Lab Review: HP 3PAR Peer Motion Software

This ESG Lab review sponsored by HP + Intel documents hands-on testing of HP 3PAR Peer Motion Software's distributed volume.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

ESG Lab Review: HP 3PAR Peer Motion Software

This ESG Lab review documents hands-on testing of HP 3PAR Peer Motion Software's distributed volume management with a focus on federated workload balancing, asset management, and thin provisioning.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Deliver Cost-Effective Business Continuity with Extreme Capacity

IBM DB2 provides application cluster transparency technology that equips organizations running OLTP applications with the ability to deliver high availability and continuous uptime for transactional data, plus the flexibility and capacity they need to remain competitive.

White Paper

What Developers Want: The End of Application Redeploys

Eliminate application restarts in Java with JRebel! JRebel is a JVM plugin that eliminates application redeploys from the Java development cycle, a process that takes over 10 minutes of coding time away from developers each working hour, according to a recent survey. Just code, refresh and see everything instantly.

See more White Papers | Webcasts

Ask a question

Ask a Question