Meetup #123

Declarative MLOps - Streamlining Model Serving on Kubernetes

Data Scientists prefer Jupyter Notebooks to experiment and train ML models. Serving these models in production can benefit from a more streamlined approach that can guarantee a repeatable, scalable, and high velocity. Kubernetes provides such an environment. And while third-party solutions for serving models make it easier, this talk demystifies how native K8s operators can be used to deploy models along with best practices for containerizing your own model, and CI/CD using GitOps.

Take-aways

In this talk, you'll learn: - How you can containerize ML models for serving them in 3 ways - batch, real-time with HTTP, and with a message queue. - Continuous Integration: How to use GitHub Actions to build and push these containers to the repo. - Continuous Delivery: How you can use ArgoCD to deploy these to Kubernetes. - About Ingresses & Observability. - About establishing a process around your team to incrementally release models using GitOps best practices. - How moving to Kubernetes enabled AI Hero's "SuperPowers".

In this episode

Rahul Parundekar

Rahul Parundekar

Founder, A.I. Hero, Inc.

Rahul has 13+ years of experience building AI solutions and leading teams. He is passionate about building Artificial Intelligence (A.I.) solutions for improving the Human Experience. He is currently the founder of A.I. Hero - a platform to help you fix and enrich your data with ML. At AI Hero, he has also been a big proponent of declarative MLOps - using Kubernetes to operationalize the training and serving lifecycle of ML models and has published several tutorials on his Medium blog. Before AI Hero, he was the Director of Data Science (ML Engineering) at Figure-Eight (acquired by Appen), a data annotation company, where he built out a data pipeline and ML model serving architecture serving 36 models (NLP, Computer Vision, Audio, etc.) and traffic of up to 1M predictions per day.

Twitter

LinkedIn

Demetrios Brinkmann

Demetrios Brinkmann

Host

Demetrios is one of the main organizers of the MLOps community and currently resides in a small town outside Frankfurt, Germany. He is an avid traveller who taught English as a second language to see the world and learn about new cultures. Demetrios fell into the Machine Learning Operations world, and since, has interviewed the leading names around MLOps, Data Science, and ML. Since diving into the nitty-gritty of Machine Learning Operations he felt a strong calling to explore the ethical issues surrounding ML. When he is not conducting interviews you can find him making stone stacking with his daughter in the woods or playing the ukulele by the campfire.

Ben Epstein

Ben Epstein

Host

Ben was the machine learning lead for Splice Machine, leading the development of their MLOps platform and Feature Store. He is now a founding software engineer at Galileo (rungalileo.io) focused on building data discovery and data quality tooling for machine learning teams. Ben also works as an adjunct professor at Washington University in St. Louis teaching concepts in cloud computing and big data analytics.