Coffee Sessions #99

Getting the Most Out of your AI Infrastructure

Run:AI is building a cloud-based platform for building with AI. In this talk, we hear all about why this need exists, how this works, and what value it creates.

Take-aways

- Why Kubernetes isn't built for AI workloads; Discuss the challenges of running AI workloads on Kubernetes, specifically around queuing, GPUs, etc. Some of the topics; - Lack of proper GPU support - Lack of multiple queues - Lack of advanced scheduling like batch and gang scheduling Hot take; Kubernetes is not built for AI

In this episode

Ronen Dar

Ronen Dar

CTO, Run:AI

Run:ai co-founder and CTO Ronen Dar was previously a research scientist at Bell Labs and has worked at Apple and in Intel in multiple R&D roles. As CTO, Ronen manages research and product roadmap for Run:ai, a startup he co-founded in 2018. Ronen is the co-author of many patents in the fields of storage, coding, and compression. Ronen received his B.S., M.S. and Ph.D. degrees from Tel Aviv University.

LinkedIn

Gijsbert Janssen van Doorn

Gijsbert Janssen van Doorn

Director Technical Product Marketing, Run:ai

Gijsbert Janssen van Doorn is Director of Technical Product Marketing at Run:ai. He is a passionate advocate for technology that will shape the future of how organizations run AI. Gijsbert comes from a technical engineering background, with six years in multiple roles at Zerto, a Cloud Data Management and Protection vendor.

LinkedIn

Demetrios Brinkmann

Demetrios Brinkmann

Host

Demetrios is one of the main organizers of the MLOps community and currently resides in a small town outside Frankfurt, Germany. He is an avid traveller who taught English as a second language to see the world and learn about new cultures. Demetrios fell into the Machine Learning Operations world, and since, has interviewed the leading names around MLOps, Data Science, and ML. Since diving into the nitty-gritty of Machine Learning Operations he felt a strong calling to explore the ethical issues surrounding ML. When he is not conducting interviews you can find him making stone stacking with his daughter in the woods or playing the ukulele by the campfire.

Vishnu Rachakonda

Vishnu Rachakonda

Host

Vishnu Rachakonda is the operations lead for the MLOps Community and co-hosts the MLOps Coffee Sessions podcast. He is a machine learning engineer at Tesseract Health, a 4Catalyzer company focused on retinal imaging. In this role, he builds machine learning models for clinical workflow augmentation and diagnostics in on-device and cloud use cases. Since studying bioengineering at Penn, Vishnu has been actively working in the fields of computational biomedicine and MLOps. In his spare time, Vishnu enjoys suspending all logic to watch Indian action movies, playing chess, and writing.