Big Data

Big Data | News, how-tos, features, reviews, and videos

big data code binary tunnel
maze lost question direction wayward

Illustration of head made out of gears with 2 hands holding it with cloud background

Automated machine learning or AutoML explained

AutoML frameworks and services eliminate the need for skilled data scientists to build machine learning and deep learning models

big data elephant analytics risk predictions vulnerable

HPE plus MapR: Too much Hadoop, not enough cloud

MapR gives HPE superior big data analytics technology and expertise, but not what HPE needs most

Clash of fists in silhouette

Julia vs. Python: Which is best for data science?

Python has turned into a data science and machine learning mainstay, while Julia was built from the ground up to do the job

blank tag isolated on white 95754104

Supervised learning explained

Supervised learning turns labeled training data into a tuned predictive model

blockchain network machine learning neural network

What is TensorFlow? The machine learning library explained

TensorFlow is a Python-friendly open source library for numerical computation that makes machine learning faster and easier

Dial allowing selection by flags of the world.

Natural language processing explained

Deep learning has improved machine translation and other NLP tasks by leaps and bounds

data science certification graduate with mortar board

15 best data science bootcamps for boosting your career

Whether you’re a recent grad, seasoned IT pro or someone looking to make a career change, these bootcamps will set you on the right path for a career in data science.

abstract data

Deep learning explained

Deep neural networks can solve the most challenging problems, but require abundant computing power and massive amounts of data

A woman holds a tablet and selects a graduation cap symbol from a virtual interface.

Top 14 data engineer and data architect certifications

Data engineers and data architects are in high demand. Here are the certifications that will give your career an edge.

ethical ai artificial intelligence algorithms

Financial firms bank on A.I. as pilot projects head to production

While AI is a buzzword in financial services, companies must be sure of a business use case even before putting together an AI dev team.

Exploding binary numbers

Machine learning algorithms explained

Machine learning uses algorithms to turn a data set into a model. Which algorithm works best depends on the problem


Delta Lake gives Apache Spark data sets new powers

A new open source project from Databricks adds ACID transactions, versioning, and schema enforcement to Spark data sources that don't have them

container ship storage transport colorful containers diversity outsourcing

IBM preps Watson AI services to run on Kubernetes

IBM Watson services arrive in versions that can run on the public cloud or on privately hosted container infrastructure

big data messaging system / information architecture / mosaic infrastructure

Built for realtime: Big data messaging with Apache Kafka, Part 2

Learn how to use Apache Kafka's partitions, message offsets, and consumer groups to distribute load and scale your applications horizontally, handling up to millions of messages per day

big data messaging system / information architecture / mosaic infrastructure

Built for realtime: Big data messaging with Apache Kafka, Part 1

Apache Kafka scales horizontally and offers much higher throughput than some traditional messaging systems. Get started with installation, then build your first Kafka messaging system

bos 2018 main rev

Bossies 2018: The Best of Open Source Software Awards

InfoWorld recognizes the leading open source projects for software development, cloud computing, big data, and machine learning

bos 2018 data

The best open source software for data storage and analytics

InfoWorld’s 2018 Best of Open Source Software Award winners in databases and data analytics

data lake

What is a data lake? Flexible big data management explained

A data lake can be a much more flexible repository than a data warehouse. Or it can be a trash dump that grows and grows

Load More