Zach Musgrave
Zach leads development for Dolt, the world's first SQL database that you can fork and clone, branch and merge, push and pull just like a git repository. Zach studied computer science at the University of Washington, and spent the first 13 years of his career split between Amazon and Google before joining DoltHub. He's a fierce advocate for the value of client-side software in a server-side world.
Sessions
Version control for data is a critical but underserved component of modern data science. Besides giving you reproducibility, true data version control enables diffs and collaboration. In this talk, we'll introduce Dolt, a free and open source version controlled database modeled after git, and demonstrate how you can use it to add reproducibility and the other benefits of data version control to your data pipeline. This talk is for data scientists and engineers, especially those who write scripts to automate pipelines. Working knowledge of git is very helpful but not required.