Automated machine learning or AutoML explained

AutoML frameworks and services eliminate the need for skilled data scientists to build machine learning and deep learning models

HPE plus MapR: Too much Hadoop, not enough cloud

MapR gives HPE superior big data analytics technology and expertise, but not what HPE needs most

Julia vs. Python: Which is best for data science?

Python has turned into a data science and machine learning mainstay, while Julia was built from the ground up to do the job

Supervised learning explained

Supervised learning turns labeled training data into a tuned predictive model

What is TensorFlow? The machine learning library explained

TensorFlow is a Python-friendly open source library for numerical computation that makes machine learning faster and easier

Natural language processing explained

Deep learning has improved machine translation and other NLP tasks by leaps and bounds

15 best data science bootcamps for boosting your career

Whether you’re a recent grad, seasoned IT pro or someone looking to make a career change, these bootcamps will set you on the right path for a career in data science.

Deep learning explained

Deep neural networks can solve the most challenging problems, but require abundant computing power and massive amounts of data

Top 14 data engineer and data architect certifications

Data engineers and data architects are in high demand. Here are the certifications that will give your career an edge.

Financial firms bank on A.I. as pilot projects head to production

While AI is a buzzword in financial services, companies must be sure of a business use case even before putting together an AI dev team.

Machine learning algorithms explained

Machine learning uses algorithms to turn a data set into a model. Which algorithm works best depends on the problem


Delta Lake gives Apache Spark data sets new powers

A new open source project from Databricks adds ACID transactions, versioning, and schema enforcement to Spark data sources that don't have them

IBM preps Watson AI services to run on Kubernetes

IBM Watson services arrive in versions that can run on the public cloud or on privately hosted container infrastructure

Built for realtime: Big data messaging with Apache Kafka, Part 2

Learn how to use Apache Kafka's partitions, message offsets, and consumer groups to distribute load and scale your applications horizontally, handling up to millions of messages per day

Built for realtime: Big data messaging with Apache Kafka, Part 1

Apache Kafka scales horizontally and offers much higher throughput than some traditional messaging systems. Get started with installation, then build your first Kafka messaging system

Bossies 2018: The Best of Open Source Software Awards

InfoWorld recognizes the leading open source projects for software development, cloud computing, big data, and machine learning

The best open source software for data storage and analytics

InfoWorld’s 2018 Best of Open Source Software Award winners in databases and data analytics

What is a data lake? Flexible big data management explained

A data lake can be a much more flexible repository than a data warehouse. Or it can be a trash dump that grows and grows

