Blog

April 7, 2026

Engineering An AI Agent To Navigate Large-scale Event Data - Part 2

Part 2: From Query Patterns to Intelligent Tools & Agent Design

View Article

March 31, 2026

Engineering the Memory Layer For An AI Agent To Navigate Large-scale Event Data

Part 1 – Data Schema, Embeddings, And Graph Design For An Agentic Query Engine On ApertureDB

View Article

March 24, 2026

The Illustrated Guide on How to Use AI Coding Platforms

AI coding platforms work best when you treat the AI as a junior engineer, not a replacement for your thinking. Break problems into small tasks, plan in Markdown before coding, and keep your context window lean - accuracy drops sharply past 50% capacity. Never debug in the same chat where you built the feature; the AI is biased by its own logic. For existing codebases, reference well-written code as examples. For new projects, define strict guardrails early - without them, the AI makes hundreds of arbitrary decisions that compound into a mess. The blog dives deep into all the patterns that work, the anti-patterns that silently kill your codebase, and strategies for both brownfield and greenfield projects - each illustrated with detailed diagrams. You stay the architect; the AI executes.

View Article

March 17, 2026

Chaigent: An affordable alternative to Gemini Enterprise on Google Cloud

Chaigent combines Chainlit and Vertex AI to deliver a code-first, serverless AI agent platform that avoids costly per-seat licensing fees. It empowers developers to build highly customizable, enterprise-grade agents using a scalable pay-as-you-go architecture.

View Article

March 10, 2026

mAIdAI: Building a Personal Assistant with Google Cloud and Vertex AI

mAIdAI is a lightweight personal AI assistant built with Google Chat, Cloud Run, and Vertex AI, designed to automate repetitive micro-tasks. By grounding the model with a local markdown context file, it provides highly personalized workflow assistance directly within your chat environment.

View Article

March 3, 2026

MLOps Coding Skills: Bridging the Gap Between Specs and Agents

This article explores how to use "Agent Skills"—simple Markdown-based context modules—to ensure AI agents strictly adhere to your team's MLOps practices and tooling preferences. By providing explicit organizational rules upfront, developers can eliminate generic boilerplate and align AI-generated code with production-grade standards.

View Article

February 24, 2026

How To Get Started With Kubernetes: A Practical Guide

A hands-on beginner roadmap for learning Kubernetes, designed to walk you through core concepts (like clusters, pods, services, deployments, storage, RBAC, autoscaling, etc.) with simple explanations, CLI examples, and practical exercises. By following it you build real experience and are prepared to use Kubernetes locally or on cloud platforms like GKE or EKS.

View Article

February 17, 2026

Building with A2UI: Extending the Expressiveness of AI Agent Interfaces

This post details the practical application of the A2UI protocol, introducing the Agent-View-Controller (AVC) pattern to decouple agent logic from UI rendering. It highlights that while A2UI enables secure, adaptable interfaces, a hybrid architecture combining static and dynamic elements is often required to balance expressiveness with latency.

View Article

February 10, 2026

Finding the Holy Grail of AI Agent UIs: From AI-Orchestrated Development to A2UI

Addressing the challenge of AI agent exposition, this post evaluates various implementation paths, including full-stack frameworks and AI-generated code. It identifies A2UI as a promising declarative solution that enables dynamic, secure interfaces by decoupling the agent's logic from the client's rendering capabilities.

View Article

February 4, 2026

Stop Guessing: A Systematic Guide to Fixing CUDA Out of Memory Errors in GRPO Training

This blog explains a systematic way to fix CUDA out-of-memory (OOM) errors during GRPO reinforcement learning training, instead of randomly lowering hyperparameters until something works. Subham argues that most GPU memory issues come from three sources: vLLM reserving GPU memory upfront (often the biggest chunk), training activations (which scale with batch size, sequence length, number of generations, and model size), and model memory (usually the smallest contributor). By carefully reading the OOM error message and estimating how memory is distributed across these components, you can identify exactly what’s causing the crash. The recommended approach is to calculate memory usage first, then adjust the highest-impact settings, such as GPU memory allocation for vLLM, number of generations, batch size, and sequence length. The guide also shows how to maintain training quality by using techniques like gradient accumulation instead of simply shrinking everything. Overall, the key message is: treat OOM debugging as a measurable engineering problem, not trial-and-error, so you can fix memory issues faster while preserving training performance.

View Article

Become Part of the Global Movement

Become part of a thriving network of over 70,000 AI and ML professionals. Experience unparalleled opportunities for learning, collaboration, and growth—all for free!

Join the Community