Strata Data Conference New York 2017
The Strata Data Conference is where cutting-edge science and new business fundamentals intersect—and merge. Several of us will be there in September, discussing platforms, strategy, and tools. Let us know if you’ll be attending and would like to chat.
Tuesday, September 26
What are the essential components of a data platform? This tutorial will explain how the various parts of the Hadoop, Spark, and big data ecosystems fit together in production to create a data platform supporting batch, interactive, and real-time analytical workloads.By tracing the flow of data from source to output, we’ll explore the options and considerations for components, including:
- Acquisition from internal and external data sources
- Ingestion: offline and real-time processing
- Storage
- Analytics: batch and interactive
- Providing data services: exposing data to applications
We’ll also give advice on:
- tool selection
- the function of the major Hadoop components and other big data technologies such as Spark and Kafka
- integration with legacy systems
In this tutorial, we will share our methods and observations from three years of effectively deploying data science in enterprise organizations. Attendees will learn how to build, run, and get the most value from data science teams, and how to work with and plan for the needs of the business.
Agenda:
- Data science in the enterprise
- Building a data-driven culture
- Organizational concerns for data science
- Data science techniques
- Methods for running a data science project
- Hiring and managing data scientists
- Tools and platforms
- Deploying data science: from the lab to the factory
- Data science maturity models
Thursday, September 28
Ask me anything: Running data science in the enterprise and architecting data platforms
- Managing data science in the enterprise
- Architecting a data platform
- Creating a modern enterprise data strategy
Even if you don’t have a specific question, join in to hear what others are asking.