Many companies have begun to see real value in capturing, storing and analyzing large volumes of operational and historical data, he said. "But it can get very expensive when you start talking about petabytes of data on a Teradata [RDBMS] platform" he said. "Hadoop is much more inexpensive."
Hadoop allows companies to gather, store and refine a wide variety of information, which they can then move to another warehouse for analysis, he said. Many companies have also begun running data analytics applications directly on top of their Hadoop environments, he said.
"The big analytics vendors certainly do need to have a Hadoop distribution," he said. "I think there is an urgency to have a Hadoop strategy."
Jaikumar Vijayan covers data security and privacy issues, financial services security and e-voting for Computerworld. Follow Jaikumar on Twitter at @jaivijayan or subscribe to Jaikumar's RSS feed . His e-mail address is email@example.com .
Read more about bi and analytics in Computerworld's BI and Analytics Topic Center.