Tag: MLops

May 8, 2024

Why Do We Need A Purpose-Built Database For Multimodal Data?

Recently, data engineering and management have grown difficult for companies building modern applications. There is one leading reason—lack of multimodal data support.

View Article

April 30, 2024

Unraveling GPU Inference Costs for Fine-tuned Open-source Models V/S Closed Platforms

Dive into any large-scale deployment of AI models, and you’ll quickly see the elephant in the room isn’t training cost – it’s inference. Here’s a hard-hitting fact: many AI companies shell out over 80% of their capital just on compute resources. In the words of Sam Altman⁹, Compute costs are eye-watering.

View Article

April 30, 2024

A great MLOps project should start with a good Python Package 🐍

MLOps practitioners (rightfully) point out that running notebooks in production is a bad software practice , but what are the alternatives? A simple script is not enough to capture the complexity of AI/ML projects, and rewriting a whole project in another programming language is both costly and time-consuming. To solve this problem, the most efficient approach is to create a Python package that compiles the project sources and assets in a code archive . However, building such a package can be a complex endeavor for newcomers.

View Article

April 30, 2024

Multilingual CLIP with HuggingFace + PyTorch Lightning 🤗 ⚡

Training OpenAI’s CLIP on Google Collab CLIP This is a walkthrough of training CLIP by OpenAI. CLIP was designed to put both images and text into a new projected space such that they can map to each other by simply looking at dot products. Traditionally training sets like Imagenet only allowed you to map images to a single class (and hence one word).

View Article

April 30, 2024

Is AI/ML Monitoring just Data Engineering? 🤔

While the future of machine learning and MLOps is being debated , practitioners still need to attend to their machine learning models in production. This is no easy task, as ML engineers must constantly assess the quality of the data that enters and exits their pipelines, and ensure that their models generate the correct predictions. To assist ML engineers with this challenge, several AI/ML monitoring solutions have been developed.

View Article

April 30, 2024

How I Transformed My Data Science Team’s Productivity by Introducing DataOps & MLOps

Introduction As a recent engineering graduate within a small but ambitious data science team, I was determined to increase our productivity and dive into the world of cloud technologies and DataOps/MLOps practices. Eager to learn and grow, I saw this endeavor as an opportunity not only to transform our team’s performance but also to gain valuable insights into cloud computing and the dynamic fields of DataOps and MLOps. In this blog post, I’ll share the transformative journey of how armed with determination and a thirst for knowledge, I implemented Data and MLOps principles that not only skyrocketed our team’s productivity but also empowered us to tackle complex data challenges with newfound efficiency and expertise.

View Article

April 30, 2024

Finding Harmony in MLOps: Balancing Functional and Object-Oriented Approaches ☯

Programmers have always been passionate about their preferences, whether they discuss spaces vs. tabs , Vim vs. Emacs , or light mode vs.

View Article

April 30, 2024

Forging a Personal Chatbot with OpenAI API, Chroma DB, HuggingFace Spaces, and Gradio 🔥

If you have checked the Internet in 2023 , you’re likely familiar with Generative AI . The launch of ChatGPT has sparked a surge in interest , investment , and innovative projects focused on Large Language Models (LLMs), Artificial General Intelligence (AGI), and Retrieval Augmented Generation (RAG). Yet, navigating this burgeoning field can be challenging : What tangible benefits does Generative AI offer? How complex is it to develop a Generative AI application? What kind of performance can one expect from such applications in real-world projects? In this article, I outline a project I developed to field questions about my resume and deepen my understanding of Generative AI.

View Article

April 30, 2024

How to configure VS Code for AI, ML and MLOps development in Python 🛠️️

Visual Studio Code is a remarkable application. In the past, developers faced a choice between simple, lightweight text editors like Vim, Atom, and Emacs or powerful yet complex IDEs such as PyCharm and Eclipse. I can recall a time when running my IDE (Netbeans), my browser (Firefox), and my application (Java) simultaneously on a laptop with just 512MB of RAM was a challenge.

View Article

April 30, 2024

🤸⚕️Unleashing the Power of Large Language Models in Healthcare and Wellness: Practical Context Providing in Healthcare and Wellness with Mistral 7B

Healthcare and Wellness in Everyday Life (Bar Edition) We find ourselves on the brink of a transformative era in artificial intelligence, poised to reshape our daily experiences. At the forefront of this revolution stand Large Language Models (LLMs), heralding a new wave of possibilities. While LLMs have already begun making strides across various sectors, their full potential remains untapped, particularly in specialised fields such as healthcare and wellness.

View Article

April 30, 2024

Towards AGI: Making LLMs Better at Reasoning (1/2)

Techniques to make LLMs proficient in math and symbolic reasoning – by a former ML Engineer Photo generate by DALL-E 3 with prompt ‘AI that can reason and looks like a mathematician’ In a latest news by Reuters, several OpenAI staff researchers wrote a letter to the board of directors warning them of a powerful AI discovery (potential AGI) from an internal project named Q-Star that they said could threaten humanity. The letter claims that the AI can already solve grade-school level math problems better than humans. But how can math proficiency lead to AGI? A machine that can perform mathematics beyond rote memorization should, in theory, be able to learn to do other tasks that build on existing information, such as writing computer code or drawing conclusions from a news article, remarkably well.

View Article

March 11, 2025

DISTRIBUTED TRAINING IN MLOPS: Accelerate MLOps with Distributed Computing for Scalable Machine Learning

This series explores the potential of distributed MLOps in accelerating AI innovation. From foundational strategies like data and pipeline parallelism to advanced techniques for unifying mixed AMD and NVIDIA GPU clusters, the articles provide insights into building scalable, cost-effective systems. - Distributed Training: Leveraging frameworks like PyTorch DDP, MPI, and Ray to split workloads across GPUs and nodes, reducing training times from years to days. - Mixed Hardware Ecosystems: Bridging CUDA and ROCm with UCC/UCX to unify AMD and NVIDIA GPUs, eliminating vendor lock-in and maximizing infrastructure ROI. - Kubernetes Orchestration: Automating GPU resource allocation, fault tolerance, and gang scheduling with tools like Volcano and Kubeflow for enterprise-scale efficiency. - Performance Optimization: Techniques like RDMA, NUMA alignment, GPU sharing (MIG/SR-IOV), and collective communication tuning (NCCL/RCCL) to achieve near-linear scaling. Whether scaling trillion-parameter models or managing mergers with fragmented infrastructures, this series describes how teams can transform the available hardware into unified collectives — driving faster innovation, reducing costs, and future-proofing MLOps pipelines.

View Article

March 19, 2025

Distributed Training in MLOps: How to Efficiently Use GPUs for Distributed Machine Learning in MLOps

Efficient GPU orchestration is crucial in MLOps to support the distributed training and serving of increasingly complex models.

View Article

April 7, 2025

Distributed Training in MLOps Break GPU Vendor Lock-In: Distributed MLOps across mixed AMD and NVIDIA Clusters

This third article in the series on Distributed MLOps explores overcoming vendor lock-in by unifying AMD and NVIDIA GPUs in mixed clusters for distributed PyTorch training, all without requiring code rewrites: Mixing GPU Vendors: It demonstrates how to combine AWS g4ad (AMD) and g4dn (NVIDIA) instances, bridging ROCm and CUDA to avoid being tied to a single vendor. High-Performance Communication: It highlights the use of UCC and UCX to enable efficient operations like all_reduce and all_gather, ensuring smooth and synchronized training across diverse GPUs. Kubernetes Made Simple: How Kubernetes, enhanced by Volcano for gang scheduling, can orchestrate these workloads on heterogeneous GPU setups. Real-World Trade-Offs: While covering techniques like dynamic load balancing and gradient compression, it also notes challenges current limitations. Overall, the piece illustrates how integrating mixed hardware can maximize resource potential, delivering faster, scalable, and cost-effective machine learning training.

View Article

April 30, 2024

What is the secret formula for MLOps success?

This article is written by Hristo Krastev ( The original post can be found here ). Clearly, no mastermind holds the key. A better place to search might be in the sleepless nights and the overtime hours spent on operationalizing machine learning models.

View Article

April 30, 2024

What reads impacted my ML Engineering Journey most?

This blog is written by Vishnu Rachakonda , a Data Scientist at FirstHand. Why I’m writing this Inspired by Ben Kuhn’s Essays on programming I think about a lot , I put together a list of the most influential reads on my journey as a machine learning engineer. I would highly recommend all of these essays as must-reads for MLEs at any stage of their career.

View Article

April 30, 2024

Components of a Production ML System Using Only Python

This blog is written by Kyle Gallatin , a software engineer on the machine learning platform team at Etsy . 💡 Learning about production ML systems is hard, and getting hands-on experience with them can be even harder. In this post Kyle Gallatin blog breaks down some common components of production ML systems and demonstrates how you can implement simplified versions of them using just Python code.

View Article

April 30, 2024

MLOps is 98% Data Engineering.

This post was originally written by Kostas Pardalis in his blog . MLOps is 98% Data Engineering 💡 TL;DR MLOps emerged as a new category of tools for managing data infrastructure, specifically for ML use cases with the main assumption being that ML has unique needs. After a few years and with the hype gone, it has become apparent that MLOps overlap more with Data Engineering than most people believed.

View Article

April 30, 2024

Machine Learning Engineering and Operations

Author: Segun Adelowo Based on my experience here is a summary for individuals interested in getting started in Machine Learning Engineering and Machine Learning Operations and who want to improve their skills. Content: Model production challenges (O Model, where is thy value?) What is Machine Learning Engineering? Machine Learning Production Steps Career Growth for a Machine Learning Engineer Resources What is Machine Learning Operations? Machine Learning Operations Steps Career Growth for a Machine Learning Operations Engineer Roles and Skills for MLE and MLOPs Additional Resources O Model, where is thy value? ML and MLOPs are still in their early years, there is no universal standard for doing ML and MLOPS yet when compared to software engineering. A few of the most common reasons why ML models don’t get to production or thrive in production include: One major issue is that POCs are typically built with a limited scope and a specific set of data, which may not be representative of the real-world conditions in which the model will be deployed.

View Article

April 30, 2024

MLOps Maturity Assessment

Author: Vechtomova Maria As more and more companies rely on machine learning to run their daily operations, it’s becoming important to adopt MLOps best practices. However, it can be hard to find structured information on what those best practices actually are and how a company can become more mature in terms of MLOps if they decide to start on that journey. Microsoft’s MLOps maturity model or Google’s definition of MLOps levels is a good start, but they don’t necessarily offer actionable insights on how a project can go from being MLOps immature to achieving 100% MLOps maturity.

View Article

April 30, 2024

The Minimum Set of Must-Haves for MLOps

Author: Başak Tuğçe Eskili In the previous article we introduced MLOps maturity assessment . That assessment can also be interpreted as MLOps standards, a checklist for ML models before they go to production. It is highly recommended to include these standards as part of the “definition of done” in Agile methodology.

View Article

April 30, 2024

Traceability & Reproducibility

Author: Vechtomova Maria In the context of MLOps, traceability is the ability to trace the history of data, code for training and prediction, model artifacts, and environment used in development and deployment. Reproducibility is the ability to reproduce the same results by tracking the history of data and code version. Traceability allows us to debug the code easily when there are unexpected results seen in production.

View Article

April 30, 2024

We need POSIX for MLOps

Author: Médéric Hurier (Fmind) If you work on MLOps, you must navigate an ever-growing landscape of tools and solutions . This is both an intense source of stimulation and fatigue for MLOps practitioners. Machine Learning, Artificial Intelligence & Data Landscape — MAD 2023 Vendors and users face the same problem: How can we combine all these tools without the combinatorial complexity of creating custom integrations ? import math # number of AI/ML tools -> number of possible integrations print({n: math.

View Article

April 30, 2024

Fixing the MLOps Survey on LLMs with ChatGPT API: Lessons Learned

Large Language Model (LLM) is such an existing topic. Since the release of ChatGPT , we saw a surge of innovation ranging from education mentorship to finance advisory . Each week is a new opportunity for addressing new kinds of problems , increasing human productivity , or improving existing solutions.

View Article

April 30, 2024

Machine Learning is on the Edge: Getting Ready for the Leap

According to Gartner by 2025, more than 50% of enterprise-critical data will be created and processed outside the data center or cloud. As we start to think about all the possibilities for AI and Machine Learning projects this will generate, we have to remember that only 50% of AI initiatives reach an operational state (Production) within the enterprise. Not only that.

View Article

April 30, 2024

Driving Business Innovation with NLP and LLMs – Part 1 – QA Models

In this series of articles, we’re going to explore the powerful business benefits that come from leveraging large language models (LLMs) and natural language processing (NLP). We won’t be having a deep dive into super technical details like how BERT, ELECTRA & GPT architectures are, but rather focus on ways of leveraging these tools and technologies on delivering business value. These technologies aren’t just trending buzzwords.

View Article

April 30, 2024

Concepts for Reliability of LLMs in Production

Traditional NLP models are trainable, deterministic, and for some of them, explainable. When we encounter an erroneous prediction that affects downstream tasks, we can trace it back to the model, rerun the inference step, and reproduce the same result. We obtain valuable information like confidences (prediction probabilities) as a measure of the model’s ability to perform the task given the inputs (instead of silently hallucinating) and retrain it to patch its understanding of the problem space.

View Article

April 30, 2024

Evaluating and Debugging Generative AI : A Deep Dive Into the Second Lesson

Generative AI (GenAI) is having a moment. In just the past few months, diffusion and large language models have revolutionized the field of machine learning. From creating realistic images to generating human-like text, not a month goes by where there isn’t a new, powerful model.

View Article

April 30, 2024

Explainable AI: Visualizing Attention in Transformers

And logging the results in an experiment-tracking tool Photo by Jeffery Ho on Unsplash , edited by the author. In this article, we explore one of the most popular tools for visualizing the core distinguishing feature of transformer architectures: the attention mechanism. Keep reading to learn more about BertViz and how you can incorporate this attention visualization tool into your NLP and MLOps workflow with Comet.

View Article

April 30, 2024

Building the Future with LLMOps: The Main Challenges

The following is an extract from Andrew McMahon ’s book, Machine Learning Engineering with Python, Second Edition. Available on Amazon at https://packt. link/w3JKL.

View Article

April 30, 2024

MLOps: More Oops than Ops

🤖 image generated using the Stable Diffusion 2. 1 model mentioned in this post As model complexity increases exponentially, so too does the need for effective MLOps practices. This post acts as a transparent write-up of all the MLOps frustrations I’ve experienced in the last few days.

View Article

April 30, 2024

Conversational Memory in LangChain with Milvus

LangChain is a robust framework for building LLM applications. However, with that power comes quite a bit of complexity. LangChain provides many ways to prompt an LLM and essential features like conversational memory.

View Article

April 30, 2024

ML Proverbs

Model Maxims & Data Dogmas This article was originally published on Leonard’s Substack . In the same vein as Rob Pike’s Go Proverbs , engineers relish a good aphorism. I’ve tried to consolidate these frequent murmurs of the ML/AI community into somewhat tangible anchors: Garbage in, garbage out.

View Article

April 30, 2024

Why You Don’t Want to Use Your Data Warehouse as a Feature Store

THOUGHT LEADERSHIP At Tecton, we help ML teams deal with the data challenges of production machine learning. Specifically, we’ve seen a lot of teams struggle with deploying the infrastructure and managing the data pipelines to produce and serve up-to-date and reliable model inputs (or features) with the right throughput and latency for real-time decisioning use cases (such as fraud, risk, personalization, etc. ) To help deal with these challenges, we’ve seen teams try to build feature stores in-house or buy a solution.

View Article

April 30, 2024

Competitive Differentiation for Foundation Models in the LLM Space

Compute Performance, Safety and Alignment, Accuracy and Retrieval Augmented Generation are Three Emerging Differentiation Vectors Machine Learning foundation models are a new category that has largely been undifferentiated – the major providers have been competing on similar types of customer benefits. A lot of the development focus this past year has been on model attributes such as context length, rate of hallucinations and the size of the training data. At the same time, both mature and newly established AI companies have been developing their own foundation models and associated development platforms and making them available to 3rd-parties to be the engine of their AI applications.

View Article

April 30, 2024

Back from Apply(ops) 23 conference

In mid-November, the Apply(ops) 23 conference , organized by Tecton and Demetrios Brinkmann , took place. Usually, I write a one-page summary for my teammates to share learnings and good references, which I also post on LinkedIn. However, this time, I decided to write an article instead as I found the slides and speech full of good tips.

View Article

April 30, 2024

LLMOps: Why Does It Matter?

MLOps has become a popular term in the context of machine learning pipelines. It refers to the various operations involved, from building models to deploying them in real-world settings. Its goal is to ensure that machine learning processes are reproducible, reliable and capable of adapting to changes in data.

View Article

April 30, 2024

How to Build a Knowledge Assistant at Scale

Introduction The discussion about the myriad applications of Large Language Models (LLMs) is extensive and well-trodden in tech circles 1 . These models have opened many use cases, reshaping sectors from customer service to content creation. However, an often-overlooked aspect in this discourse is the practicality of productizing and scaling these use cases to support tens of thousands of users 2.

View Article

April 30, 2024

AI Tidbits 2023 SOTA Report

Looking back at 2023’s advancements to gauge how far we’ve come since 2022 Note: “SOTA” stands for state-of-the-art, referring to the most advanced and effective models currently available in the field. Exactly a year ago, ChatGPT was one month old, Anthropic just released Claude, and Microsoft unveiled the first zero-shot model to clone someone’s voice. Long before Google Bard’s debut, Stanford’s inaugural autonomous agents paper, and the incorporation of the video-generating startup Pika Labs.

View Article

April 30, 2024

Building LLM Platforms for Your Organisation – Step 2 Platforming.

Resuming from my initial article on the 31 of October, stating that a clear understanding of the evaluation pipeline simplifies subsequent stages of deploying LLMs to production. I planned to cover the remaining steps, but Quantum Black’s comprehensive article has since addressed these topics fairly effectively (If you haven’t read it I recommend giving it a look). Therefore, I’ll focus on designing generative AI solutions, translating the Quantum Black reference architecture into AWS services, and explaining the architectural choices.

View Article

April 30, 2024

Become the Maestro of your MLOps Abstractions

The MLOps ecosystem is evolving into a sophisticated symphony, composed of diverse tools, methodologies, and cultures. This diversity, while beneficial, also introduces a complexity reminiscent of the challenges encountered in Big Data systems . Data experts had to navigate through immense data characterized by its Volume, Variety, and Velocity.

View Article

April 30, 2024

Evaluation Survey Insights

In September 2023 we conducted a survey with the MLOps Community on evaluating LLM systems. More than 115 people participated. All of the response data is free for anyone to look at and examine.

View Article

April 30, 2024

How to Adapt your LLM for Question Answering with Prompt-Tuning using NVIDIA NeMo and Weights & Biases

A tutorial on prompt-tuning and p-tuning using NeMo alongside W&B, complete with an experiment and executable code. Why Prompt-Tune Instead of Fine-Tune? Let’s start with a thought experiment: Imagine you’re the owner of a vast library that contains millions of books. Over the years, you’ve meticulously organized this library, placing each book on its designated shelf, in its specific corner.

View Article

April 30, 2024

The Role of AI Safety Standards in Modern MLOps

With the recent explosive growth of AI, particularly in Generative AI, the importance of safety and reliability has surged as a paramount concern for businesses, consumers, and regulatory bodies. Recent safety standards and regulations as outlined in the EU AI Act and Biden’s executive order underscore the imperative to ensuring safe and trustworthy AI. Furthermore, the existence of over 20 ISO standards dedicated to AI safety presents a formidable challenge in integrating them effectively into operational frameworks.

View Article

April 30, 2024

Audio Generation with Mamba using Determined AI

Training the new Mamba architecture on speech + music data! As you might have noticed from my past blogs, most of my experience is in computer vision. But, recently, for obvious reasons (read: ChatGPT, LLaMas, Alpacas, etc…), I realized it’s about time I learn a thing or two about how transformers work. About 2 seconds deep into transformers literature, boom! Another foundational architecture was released (who’s surprised though?).

View Article

April 30, 2024

MLflow on AWS with Pulumi: A Step-by-Step Guide

Many data science and machine learning teams grapple with the challenge of effectively tracking numerous experiments and their corresponding results. Often, they resort to using cumbersome methods such as Excel spreadsheets and manual record-keeping, leading to overwhelming data management and hindering collaboration within the team. This manual approach not only consumes valuable time and effort but also introduces the risk of errors and inconsistencies.

View Article

April 30, 2024

Make your MLOps code base SOLID with Pydantic and Python’s ABC

MLOps projects are straightforward to initiate, but challenging to perfect. While AI/ML projects often start with a notebook for prototyping, deploying them directly in production is often considered poor practice by the MLOps community . Transitioning to a dedicated Python code base is essential for industrializing the project, yet this move presents several challenges: 1) How can we maintain a code base that is robust yet flexible for agile development? 2) Is it feasible to implement proven design patterns while keeping the code base accessible to all developers? 3) How can we leverage Python’s dynamic nature while adopting strong typing practices akin to static languages? Throughout my career, I have thoroughly explored various strategies to make my code base both simple and powerful.

View Article

April 30, 2024

7 Methods to Secure LLM Apps from Prompt Injections and Jailbreaks

Practical strategies to protect language models apps (or at least doing your best) I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no different.

View Article

June 3, 2024

The MLOps Behind Recursion’s Foundation Model Phenom-1

Recursion's development of their Phenom-1 foundation model showcases the importance of MLOps practices throughout the machine learning lifecycle. To train this large model on massive image datasets, they implemented solutions for experiment tracking, resource allocation, and data management. This included using tools like PyTorch Lightning and Hydra for better control and efficiency. They also optimized data transfer speeds and storage formats to handle the workload. Furthermore, Recursion highlights the crucial role of MLOps culture, emphasizing collaboration across diverse teams and utilizing infrastructure like their BioHive-1 supercomputer effectively.

View Article

July 29, 2024

MLOps Coding Course: Mastering Observability for Reliable ML

This article delves into the essential tools and practices for achieving comprehensive observability in your ML projects. We’ll unravel key concepts, showcase practical code examples from the accompanying MLOps Python Package, and explore the benefits of integrating industry-leading solutions like MLflow.This article delves into the essential tools and practices for achieving comprehensive observability in your AI/ML projects.

View Article

August 6, 2024

MLOps Package Template: Turbocharge the Creation of AI/ML Projects ⚡

The AI/ML field is advancing rapidly, with MLOps becoming essential for turning innovative ideas into production-ready solutions. The Cookiecutter MLOps Package simplifies this process by providing a robust code template, streamlining the setup and ensuring best practices. It offers a versatile, platform-agnostic foundation that integrates seamlessly with various MLOps environments like Kubernetes, Vertex AI, and AWS SageMaker. Equipped with tools for dependency management, automated testing, Dockerized deployment, and CI/CD workflows, this package enhances efficiency and quality, allowing developers to focus on core problems and improve collaboration and maintainability.

View Article

November 19, 2024

The Ultimate Must-Have for MLOps: SAS Viya

This article highlights SAS Viya as an all-in-one solution for MLOps, offering tools for version control, model registry, orchestration, monitoring, and deployment. It emphasizes features like experiment tracking, compute resource management, and responsible AI practices, aiming to simplify workflows while ensuring transparency and human oversight.

View Article

February 3, 2025

Poetry Was Good, Uv Is Better: An MLOps Migration Story

Poetry to Uv: A faster, simpler way to manage dependencies for MLOps projects

View Article

February 13, 2025

In NYC? Here’s Why In-Person AI Meetups Matter More Than Ever

While online resources make AI knowledge accessible, in-person meetups offer something unique—real conversations, direct access to experts, and unfiltered insights from practitioners. MLOps Days NYC, hosted with JFrog, brings together AI and MLOps professionals for practical talks, networking, and firsthand learning. Join us on March 4th at Rockefeller Plaza for an evening of deep technical discussions and valuable connections.

View Article

February 17, 2025

Lessons Learned from the Gemini Long Context Kaggle Competition 🧠

Google’s Gemini 1.5, with a 2 million-token context window, was tested in the "Gemini Long Context" Kaggle competition. The author built an AI-powered interactive textbook, leveraging Gemini to retrieve, process, and personalize learning from open-source textbooks. The project showed promise in accuracy and cross-topic learning but faced challenges with execution speed and API inefficiencies. Despite hurdles, long-context AI has the potential to revolutionize education with adaptive, AI-driven tutors.

View Article

February 17, 2025

Stop Building Rigid AI/ML Pipelines: Embrace Reusable Components for Flexible MLOps

Traditional MLOps pipelines start strong but quickly become tangled, rigid, and hard to maintain. The solution? Shift to a modular, artifact-driven approach—where Python packages, Docker images, and config files serve as reusable building blocks, orchestrated by Directed Acyclic Graphs (DAGs). This flexible design boosts experimentation, reusability, and scalability, making MLOps more efficient and future-proof.

View Article

March 26, 2025

More Automation + More Reproducibility = MLOps Python Package v4.1.0

The MLOps Python Package version 4.1.0 is now available, focusing on increased automation and reproducibility for machine learning workflows. This release transitions task automation from PyInvoke to the cleaner 'Just' system, integrates Gemini Code Assist for AI-powered GitHub pull request reviews, automates the deployment of GitHub rulesets for consistency, and ensures deterministic builds using a constraints.txt file for locked dependencies. The companion Cookiecutter MLOps Package template has also been updated to include these enhancements, facilitating easier project setup. Users are encouraged to upgrade to benefit from these improvements.

View Article

April 22, 2025

Framework, Template, or Example? 🤔 Choosing the Right AI Starter Kit for Your Team ✨

Jumpstarting AI development can be tricky, but AI starter kits can provide the necessary launchpad. The article explores three main types: Frameworks offer a highly structured, guided approach suited for mature domains and consistency; Templates provide a standardized project setup with more flexibility, ideal for diverse projects with common delivery needs; and Examples offer simple, working code illustrations, best for quickly exploring new or rapidly changing areas like Generative AI. The key is choosing the right type based on your team's needs and the specific project context to build AI applications more efficiently.

View Article

April 29, 2025

BKFC: An Agentic Workflow for Gathering Knowledge from Google Chat

This article introduces BKFC (Build Knowledge From Chats), a Python notebook designed as an agentic workflow to tackle the common problem of extracting useful information from cluttered Google Chat histories. The author explains how manually searching through chats is inefficient. BKFC automates this by fetching recent messages via the Google Chat API, processing them, and then using Vertex AI's Gemini model for analysis. Crucially, it prompts Gemini to return structured insights (like summaries, Q&A, action items, project updates) based on a predefined Pydantic schema. The tool demonstrates a practical way to use AI (specifically Gen AI and APIs) to turn conversational data into organized, actionable knowledge, saving time and improving team awareness.

View Article

May 6, 2025

When Prompt Deployment Goes Wrong: MLOps Lessons from ChatGPT’s 'Sycophantic' Rollback

An analysis of the April 2025 GPT-4o sycophancy incident through the lens of MLOps. Learn why prompt changes demand rigorous deployment strategies (Canary, Shadow) and how neglecting MLOps/LLMOps principles impacts AI safety and user trust in Large Language Models (LLMs) and Machine Learning systems.

View Article

August 26, 2025

Happy Birthday XP: Celebrating Gemini Deep Think (and My Daughter’s 6th Birthday)

In this article, Médéric Hurier tests three versions of Google's Gemini 2.5 models—Flash, Pro, and Deep Think—by challenging them to create a complex, multi-scene interactive birthday experience for his daughter. The experiment revealed an exponential gap in capability, with the advanced Gemini Deep Think model delivering a delightful, polished, and fully functional result that surpassed the other models and captivated his daughter.

View Article

September 23, 2025

Vibe Youtubing with NotebookLM: The MLOps Coding Course Gets a Video Upgrade in Under 48 Hours

Using NotebookLM’s Video Overview, I turned my MLOps Coding Course from text into a full video series in just two days. What once felt like a month-long grind became a fast and creative process — demonstrating how AI can amplify expertise instead of replacing it.

View Article

March 3, 2026

MLOps Coding Skills: Bridging the Gap Between Specs and Agents

This article explores how to use "Agent Skills"—simple Markdown-based context modules—to ensure AI agents strictly adhere to your team's MLOps practices and tooling preferences. By providing explicit organizational rules upfront, developers can eliminate generic boilerplate and align AI-generated code with production-grade standards.

View Article

Become Part of the Global Movement

Become part of a thriving network of over 70,000 AI and ML professionals. Experience unparalleled opportunities for learning, collaboration, and growth—all for free!

Join the Community