November 13–15, 2018 - Shanghai, China
Click Here For Information & Registration

To view the Chinese version of this schedule please go here.

Simultaneous translation will be provided for all keynote and breakout sessions.
Thursday, November 15 • 11:30 - 12:05
Modern Data Science in a Cloud Native World - Samuel Kreter, Microsoft

Sign up or log in to save this to your schedule and see who's attending!

Feedback form is now closed.
We now live in 2018, where the meaning of Big Data keeps getting bigger. Yet, the tools most people are using with their data requires a huge amount of experience to understand and scale. We are also facing a time where it is necessary to track the flow of data for better understanding and compliance with GDPR.

I am going to walk through how to take advantage of Kuberentes and other Cloud Native technologies with the open source project Pachyderm to create data science pipelines that are easy to develop, test, deploy and scale. I will also cover how to use Data Versioning throughout the process to track data changes and understand exactly how your data is changing.

Talk Outline:
1. Introduce the basic concepts of Data Pipelines and Versioning.
2. Create and test a simple model.
4. Scale it up to a production sized workload and automatically have changes deployed in the pipeline.

avatar for Samuel Kreter

Samuel Kreter

Software Engineer 软件工程师, Microsoft
Sam Kreter is a software engineer at Microsoft working on the Cloud Native Compute Team focused on Azure Container Instances. Previously, he worked with an SOS Venture incubator company out of Shanghai, China developing a Bitcoin transferring technology. He also worked as a research... Read More →

Thursday November 15, 2018 11:30 - 12:05
2F Room 1