Author Archive

Is Your Data Holding You Back?

In this post, we will discuss what “real” gaps in data look like and how to find them in your organization.

Pile of colorful spinning top toys

Easily Spinning up Data Platforms

A quick overview of the motivation behind our instant and repeatable data platform tool.

pipelines

Making Spark and Kafka Data Pipelines Manageable with Tuning

In this post, we’ll walk you through how to use tuning to make your Spark/Kafka pipelines more manageable.

Spark Summit: Ignition in the Enterprise

We are excited to announce for Spark Summit 2017 in San Francisco, Edd Wilder-James will be joining Reynold Xin as co-chair of the Spark Summit program.

Four Data Capabilities for Telecommunications

This post looks at four business analysis capabilities that connect the dots between promising applications of data assets for telecommunications companies.

Introducing a Value-Centered View of Data Maturity

In this post we introduce our new data maturity model, and include a link to the assessment.

The Data Platform Puzzle

Building or rebuilding a data platform can be a daunting task, as most questions that need to be asked have open-ended answers. But that doesn’t mean you have to guess and use your gut.

Models: From the Lab to the Factory

Deploying a model without a rigorous process in place has consequences. We go over techniques for successful deployment and management.

structure

Building Tech Communities

In this interview, Travis talks about how to balance enterprise and open source, as well as what it takes to build a community.