Driving ML Data Quality with Data Contracts
Andrew introduces the concept of Data Contracts and talks about how they at GoCardless are using it to improve the quality and reliability of data by empowering data consumers - including our Data Scientists - to work closely with the data generators and get the data they really need to power highly effective ML models and other data-driven products.
In this episode
Andrew Jones
Tech Lead, GoCardless
Andrew is a Senior Data Engineer and group Tech Lead, working across Data Infrastructure and ML Enablement to build best-in-class infrastructure and services to power analytics, models, and data-driven products.
Demetrios Brinkmann
Host
Demetrios is one of the main organizers of the MLOps community and currently resides in a small town outside Frankfurt, Germany. He is an avid traveller who taught English as a second language to see the world and learn about new cultures. Demetrios fell into the Machine Learning Operations world, and since, has interviewed the leading names around MLOps, Data Science, and ML. Since diving into the nitty-gritty of Machine Learning Operations he felt a strong calling to explore the ethical issues surrounding ML. When he is not conducting interviews you can find him making stone stacking with his daughter in the woods or playing the ukulele by the campfire.
Ben Epstein
Host
Ben was the machine learning lead for Splice Machine, leading the development of their MLOps platform and Feature Store. He is now a founding software engineer at Galileo (rungalileo.io) focused on building data discovery and data quality tooling for machine learning teams. Ben also works as an adjunct professor at Washington University in St. Louis teaching concepts in cloud computing and big data analytics.