Mauricio Vacas

With years of experience working with cloud computing and distributed data architectures, Mauricio is passionate about creating value with technology. He is an industry-recognized leader in technical architecture for cloud-hosted data solutions.

Mauricio is a Senior Data Engineer at Silicon Valley Data Science. He has experience working with distributed storage and processing systems such as Hadoop, Spark, Cassandra and related tools in the ecosystem; application and web services development in Spring Java and Python; and designed and built cloud technical architectures in AWS and NTT. Mauricio has deployed models into production built using Spark, R, Impala, Hive, and other tools and works to bridge the gap between model development and deployment. Prior to joining SVDS, Mauricio was a technical architecture manager working in Accenture’s R&D group and Big Data practice. He managed a team of data scientists and engineers to build a web-scale recommender and network analytics streaming services on cloud infrastructure and presented the work at Strata and DataStax NYC* conferences. He was also a main developer in Accenture’s Cloud Platform which is used in over 30 client solutions and over 1600 managed servers. He has experience working with clients in the retail, healthcare, and banking industries.

Mauricio holds a Masters of Science in Computer Engineering from the University of Florida.

Models: From the Lab to the Factory

Deploying a model without a rigorous process in place has consequences. We go over techniques for successful deployment and management.

March 15, 2017

How I Learned to Stop Worrying and Love Ephemeral Storage

This post will show architects and developers how to set up Hadoop to communicate with S3, use Hadoop commands directly against S3, use distcp to perform transfers between Hadoop and S3, and how distcp can be used to update on a regular basis based only on differences.

August 4, 2016

From Impala to Hive with Love

While on paper it should be a seamless transition to run Impala code in Hive, in reality it’s more like playing a relentless game of whack-a-mole. This post provides hints to make the transition easier.

October 20, 2015

Past Events

2025

2017

Apr 3 - 5

TDWI Accelerate Boston 2017
Boston, MA

We’ll be in Boston covering a variety of topics—from running agile data teams, to visual storytelling with data. Let us know if you’ll be there, or sign up to receive all our slides.

Details

2016

Sep 26 - 29

Strata + Hadoop World New York 2016
New York, NY

The SVDS crew will be in New York this year, talking about data platforms, data strategy, and making the business case for Spark. Come by our talks, or catch us in the hallway track.

Details

Mauricio Vacas

Recent Posts

Past Events

TDWI Accelerate Boston 2017

Strata + Hadoop World New York 2016