Big Data

Get the most from your organization's data. Advice on data analytics, backup, data deduplication, mobile data, security, and more.

NDN hackathon
microsoft alteryx

Big data makes a difference at Penn Medicine

Here’s how one healthcare organization is making use of the massive amount of information – measurable in petabytes – it now has at its disposal to save lives.

big data group

4 traits good data scientists share

Big data is creating a booming market for data scientists as companies struggle to not only store loads of data, but also analyze it and interpret it. To be a good data scientist, you'll want to master these four areas.

Big Data (4)

IBM zeroes in on unstructured data with Cleversafe buy

IBM will acquire object-based storage vendor Cleversafe in a move to bolster its cloud business unit with more flexibility and simplified management options in the hybrid cloud, it announced on Monday.

Big Data

Banks' use of big data to be scrutinized by EU regulators

Focusing on the "opportunities and challenges" associated with big data, a new investigation aims to determine whether new regulatory or supervisory measures are needed.

big data scorecard

Hortonworks unveils big data scorecard

The new Hortonworks Big Data Scorecard is designed to help organizations assess their capabilities and build a plan to jump start big data projects.

Strata Hadoop big data

BlueTalon brings Hadoop security down to the file system

Big data can mean big threats to security, but BlueTalon just launched what it calls the first-ever filtering and dynamic masking capabilities for use directly on the Hadoop Distributed File System (HDFS).

ge digital power plant screencap

Cloud-based 'digital twins' could make power plants more efficient

General Electric is introducing the Digital Power Plant, a real-time simulation of a physical power plant that lives in the cloud and allows for better management and less downtime.


Pentaho's analytics software to blend multiple streams of big data

One of the challenges of big data is blending information from multiple sources, and Pentaho has developed new software specifically to make that process easier.

Microsoft Linux

Microsoft launches its big data service running on Linux

Welcome to Satya Nadella's Microsoft. Gone are the days when Microsoft treated Linux like a cancer.

storage database

MapR adds in-Hadoop document database

MapR Technologies has added native JSON support to its MapR-DB NoSQL database, giving developers the capability to quickly deliver scalable applications that leverage continuous analytics on real-time data.

Hadoop elephant code

Cloudera unveils in-memory store, security layer for Hadoop

The Hadoop distribution specialist today announced a new open source project designed to enable real-time analytic applications in Hadoop as well as a new open source security layer for fine-grained unified access control enforcement....

Enterprise software

Anaconda's Python-based analytics hit the enterprise with new subscription plans

Continuum Analytics' free Anaconda software has long been known for its Python-powered analytics capabilities, but on Monday the company unveiled a new offering designed specifically for enterprises.

microsoft azure data lake analytics

Microsoft expands Azure Data Lake with new big data tools

Microsoft had its sights set squarely on big data when it introduced its Azure Data Lake earlier this year, and on Monday it broadened that effort with new tools designed to make big data processing and analytics simpler and more...

big data warehouse center storage

Get ready to meet Kudu, a new, open-source storage engine from Cloudera

An open-source storage engine called Kudu could soon be on the way from Cloudera, offering a new alternative for companies with big data stores to manage.

Hadoop eats data analytics

Hadoop is slowly eating conventional analytics

The components of the Hadoop ecosystem won't overthrow Teredata or IBM Netezza any time soon, but ultimately, the commodity solution almost always wins.

memsql spark streamliner real time data pipeline

MemSQL paves a smoother path to Spark for real-time analytics

Spark Streamliner is a tool that integrates MemSQL's in-memory database and Apache Spark's in-memory data-processing framework for streaming data from real-time sources.

IBM Watson Web San Francisco

This is how the future looks with IBM Watson and 'perfect data'

Watson's AI services will bring us unparalleled convenience, untold marketing opportunities, and zero privacy.

tamr catalog dark data

Bring your company's 'dark data' to light with this free new tool from Tamr

Tamr's enterprise metadata catalog -- released Thursday into public beta -- can help companies create an inventory of their data sources, including metadata about who owns the source.

Load More