Big Data

Big Data | News, how-tos, features, reviews, and videos

holiday lights neurons network stream

What is Apache Spark? The big data analytics platform explained

Fast, flexible, and developer-friendly, Apache Spark is the leading platform for large-scale SQL, batch processing, stream processing, and machine learning

What quantum computing could do for a marketer's data and information management

The digital era has been hard enough to come to grips with. But at its heart the term ‘digital’ is rooted in a very simple binary code of ones and zeros.

artificial intelligence / machine learning / network

Apache PredictionIO: Easier machine learning with Spark

An open source project now under Apache’s guidance uses a template system for easy training and deployment of Spark-powered machine learning models

General Electric

GE adds edge analytics, AI capabilities to Predix industrial IoT suite

To solidify its position at the center of the industrial IoT, GE Digital is going to the edge.

Internet of things smart city with icons

Data or metadata? For the IoT they’re both important

As the amount of machine-generated data scales, indexing it via metadata will become critical.

Oracle headquarters

Oracle leverages AI to push enterprise apps as users move to cloud

Ratcheting up the pace of enhancements to its software as a service (SaaS) and platform as a service (PaaS) offerings, Oracle is riding the wave of companies that are ditching on-premises enterprise apps and heading to the cloud.

anil chakravarthy Informatica

Informatica brings AI to GDPR compliance, data governance

Companies that are scrambling to comply with the European Union's General Data Protection Regulation (GDPR) have a new tool to consider: Informatica's Compliance Data Lake, unveiled this week at the Strata Data Conference in New York....

bossies 2017 machine learning

Bossie Awards 2017: The best machine learning tools

InfoWorld picks the best open source software for machine learning and deep learning

bossies 2017 database analytics

Bossie Awards 2017: The best databases and analytics tools

InfoWorld picks the best open source software for large-scale search, SQL, NoSQL, and streaming analytics

20160224 stock mwc sap booth sign 100647700 orig

SAP wants to embrace all your data stores with Data Hub

If data warehouses are for tidiness freaks -- information packaged into neat inferences, sorted and stacked, the rest discarded -- and data lakes are for hoarders -- tip everything in, you never know what might be useful -- then SAP's...

keys to access solutions world in palm of hand

ONNX makes machine learning models portable, shareable

Microsoft and Facebook's machine learning model format aims to let devs choose frameworks freely and share trained models without hassle

cloud data warehouse

Users review the top cloud data integration tools

IT Central Station members weigh in on Informatica Cloud Data Integration, Dell Boomi AtomSphere, IBM App Connect, and SnapLogic

privacy

How much is a good deal worth?

When advertising crosses the line into invasion of privacy, consumers need to ask hard questions about what personal data they’re giving away.

eyeing big data in the cloud

What is data mining? How analytics uncovers insights

Data mining is the automated process of sorting through huge data sets to identify trends and patterns and establish relationships

solar eclipse

When identity data eclipses digital identity

Digital identity needs to be redefined as verified identity data. Identity data, using the right tools, can be used to carry out online jobs on behalf of the real me. But the right technology, aka personal data stores, need to be in...

13 frameworks for mastering machine learning

13 frameworks for mastering machine learning

Venturing into machine learning? These open source tools do the heavy lifting for you

Cloud

Oracle's Hurd, AT&T's Donovan on their massive cloud migration deal

In this Q&A, AT&T Communications CEO John Donovan and Oracle CEO Mark Hurd talk about their deal to work together to migrate thousands of databases to the cloud

2 data center servers

IBM speeds deep learning by using multiple servers

IBM's Distributed Deep Learning spreads model training across any number of hardware nodes—as long as they’re IBM nodes

convergence collaborate ideas datastream connection fiberwire

All your streaming data are belong to Kafka

Apache Kafka continues its ascent as attention shifts from lumbering Hadoop and data lakes to real-time streams

storm clouds dark

Data is eating the software that is eating the world

The data-driven machine learning algorithms that power AI will not only upend programming, but lower the barriers to AI itself

Load More