Coffee Sessions #23

SRE for ML Infra

Demetrios wasn't sure how it happened or how he found Todd or how he even convinced him but we had none other than the head of SRE for ML infrastructure at Google come on to the coffee session and talk all things tech, managerial, and ethics around ML these days. Oh yeah, and we know there are a few out there that will be happy to know we got Vishnu to join us again. (Yes, he already has a fan club!)   Favorite quote: "Bad things happen to good data" Couldn't have been more right about that one. Two main points he made that kept us thinking and then stimulated more questions were: the idea of ML "trustworthiness" and those of us interested in MLOps right now are either going to be the ones who write the books on it or are the first ones to read 'em!

In this episode

Todd Underwood

Todd Underwood

Director of Engineering, Google

Todd Underwood is a Director at Google and leads Machine Learning for Site Reliability Engineering Director. He is also Site Lead for Google’s Pittsburgh office. ML SRE teams build and scale internal and external ML services and are critical to almost every Product Area at Google. Before working at Google, Todd held a variety of roles at Renesys. He was in charge of operations, security, and peering for Renesys’s Internet intelligence services that is now part of Oracle's Cloud service. He also did product work for some early social products that Renesys worked on. Before that Todd was Chief Technology Officer of Oso Grande, an independent Internet service provider (AS2901) in New Mexico.

@tmu

LinkedIn

Demetrios Brinkmann

Demetrios Brinkmann

Host

Demetrios is one of the main organizers of the MLOps community and currently resides in a small town outside Frankfurt, Germany. He is an avid traveller who taught English as a second language to see the world and learn about new cultures. Demetrios fell into the Machine Learning Operations world, and since, has interviewed the leading names around MLOps, Data Science, and ML. Since diving into the nitty-gritty of Machine Learning Operations he felt a strong calling to explore the ethical issues surrounding ML. When he is not conducting interviews you can find him making stone stacking with his daughter in the woods or playing the ukulele by the campfire.

Vishnu Rachakonda

Vishnu Rachakonda

Host

Vishnu Rachakonda is the operations lead for the MLOps Community and co-hosts the MLOps Coffee Sessions podcast. He is a machine learning engineer at Tesseract Health, a 4Catalyzer company focused on retinal imaging. In this role, he builds machine learning models for clinical workflow augmentation and diagnostics in on-device and cloud use cases. Since studying bioengineering at Penn, Vishnu has been actively working in the fields of computational biomedicine and MLOps. In his spare time, Vishnu enjoys suspending all logic to watch Indian action movies, playing chess, and writing.