topics that matter; ideas worth sharing

share a tip, submit a link, add something new

Companies need new ways to manage their data

April 17, 2001, 09:56 AM —  InfoWorld — 


Data is everywhere. Every day there is more and more of it. Managing this data alone is a challenge, but now companies face more and more types of data and more and more locations of that data.

To put the growth of data in perspective, a University of California at Berkeley study predicts that after taking approximately 300,000 years for humans to generate 12 exabytes (an exabyte is over 1 million terabytes or a million trillion bytes) of information, the next 12 exabytes will be accumulated in just two and a half years.

And the sources of data are growing as well. Witness the variety of corporate, personal, and industrial devices that not only house data but, more important, are becoming enabled to hook into back-end data sources to feed and retrieve data.

Meanwhile, only about 20 percent of the world's data resides in relational databases; the rest is in a combination of flat files, audio, video, prerelational, and unstructured formats -- not to mention the mountains of paper-based data just waiting to be digitized.

The result of incorporating all these different data types and sources is that data management is changing into a broader category of managing content that includes all data types, vendors say.

Data, data everywhere

To keep up with the data explosion, database vendors are working to manage more data types, and in some cases they are doing so from within the core database engine.


Beyond the horizon

All of the database vendors spend a great deal of energy and time touting the next version that will come to market, typically well before it will be generally available. Behind the scenes, however, at corporate headquarter campuses or in research labs, they are hard at work on features and functionality that won't appear in the immediately forthcoming revision.

In IBM's Silicon Valley labs, IBM Fellow Bruce Lindsay and colleagues are at work on the next generation of database replication technology, Lindsay says. Replication in this case refers to geographically dispersed databases being capable of receiving updates from each other in real time. The first generation of this technology is currently in DB2, but it is not as fast as it should be.

"You trade things like integrity for speed," Lindsay says. "What we're trying to accomplish is a combination of support in the engine and a fairly complex application that works with the database."

IBM is also working on a query optimizer that will learn the optimal route to certain data and be capable of automatically seeking out that path in the future, according to Bernie Schiefer, manager of DB2 performance.

Additionally, IBM is building what it calls a SMART (Self Managing and Resource Tuning) database, designed to reduce the human intervention needed to run and maintain it. "The long-term vision is to at least provide the option that the user not get involved," says Sam Lightstone, a senior technology development manager

I like it!
Post a comment
The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
Resources
White Paper

Symantec Backup Exec 12 and Backup Exec System Recovery 8 deliver industry leading Windows data protection and system recovery. Download this whitepaper to find out the top reasons to upgrade and how to get continuous data protection and complete system recovery.

Webcast

Data and system loss — from a hard drive failure, malicious attack, natural disaster, or simple human error — can happen anytime. Don’t leave your business vulnerable. Make sure you have a secure recovery strategy in place. Symantec's latest backup and system recovery technology can efficiently restore critical applications, individual emails and documents and even restore your entire system in minutes in the event of a loss.

White Paper

Businesses face a growing challenge to ensure that the IT environment is properly protected. Backup Exec 12 integrates with other applications in the Symantec family of products, to complement your current data protection strategy, keep your data securely backed up and make it recoverable when you need it most.

Free stuff
Featured Sponsor

AISO founders envisioned a Web hosting company that was environmentally friendly. While the company employed energy-efficient innovations like solar panels, its infrastructure produced unacceptable power and cooling requirements. Find out how AISO leveraged AMD technology to overcome their challenge in this case study white paper.

In this whitepaper, Scalar explores the opportunity to change the landscape with respect to mission critical databases built around Oracle. Leveraging technologies such as Linux, high-end commodity processing power and Oracle RAC technology to architect, design, build and maintain database infrastructure that delivers maximum availability, reliability and performance at a fraction of traditional cost.

On a typical day, weather.com, the Web site for The Weather Channel in Atlanta, serves up between 15 million and 20 million page views. But in September 2004, when back-to-back hurricanes ransacked Florida, the peak traffic on one day more than tripled: over 70 million page views by more than 7 million unique visitors. Read the full success story now.

More Resources