Archive for the ‘Tools’ Category

exploring map compass

Exploring the Possibilities of Artificial Intelligence

In this interview, Paco Nathan discusses making life more livable, AI fears, and more.

marbles small files

Handling Small Files in MapR-FS

In this post, we will discuss how dealing with small files is different if you are using MapR-FS rather than the traditional HDFS installation.

JupterCon notebook python

Themes from JupyterCon 2017

This past August was the first JupyterCon—an O’Reilly-sponsored conference around the Jupyter ecosystem, held in NYC. In this post we look at the major themes from the conference, and some top talks from each theme.

Space Shuttle Problems: Long-term Planning Amid Changing Technology

How can you manage your implementation in a way that allows you to take maximum advantage of technology innovation as you go, rather than having to freeze your view of technology to today’s state and design something that will be outdated when it launches? You must start by deciding which pieces are necessary now, and which can wait.

Data Ingestion with Spark and Kafka

In this tutorial, we will walk you through some of the basics of using Kafka and Spark to ingest data.

Understanding AI Toolkits

Understanding AI Toolkits

As well as developing familiarity with AI techniques, practitioners must choose their technology platforms wisely.

Creating a Continuous Delivery Pipeline for a Maven Project

How To Create a Continuous Delivery Pipeline for a Maven Project

We use continuous delivery automation tools and techniques that have become available in the last few years. Here we’ll walk through creation of a Maven-based Java project here and demonstrate incorporating it into our pipeline.

Exploratory data analysis in Python

Exploratory Data Analysis in Python

We summarize the objectives and contents of our PyCon tutorial, and then provide instructions for following along so you can begin developing your own EDA skills.

kafka spark pipelines monitoring alerting

Managing Spark and Kafka Pipelines

In this post, we will cover some of the basics of monitoring and alerting as it relates to data pipelines in general, and Kafka and Spark in particular.