Skip to main content

Posts

Showing posts from February, 2026

Why We Built Our Multi-Agent System on Kafka (And What We Learned)

The 3:47am Incident That Changed Our Architecture At 3:47am on a Tuesday, our monitoring dashboard lit up. Three different teams had just published the same article about agent observability. Marketing wrote it for the corporate blog. Sales adapted it for a prospect deck. Content produced it for our technical newsletter. All three versions were good. All three were complete. All three appeared within 20 minutes of each other. The problem? Nobody knew the other teams were working on it. This wasn’t a coordination failure. It was an architecture failure. Our multi-agent system had no shared truth. Each department’s agent operated independently, pulling from the same source material, generating similar content, with zero awareness of parallel work. That morning, we rebuilt our agent communication architecture on Kafka. Here’s why, what we learned, and the patterns that emerged from six months in production. The Architecture That Failed Our original multi-agent system looked clean...

Building a Model-Agnostic Multi-Agent System with OpenClaw

Over one week we rebuilt our AI stack around OpenClaw’s multi-agent architecture to avoid provider lock-in and stop wasting premium tokens. By aligning models to tasks, diversifying fallbacks across providers, enforcing minimal tool access, and switching to memory-first workflows with ephemeral sessions, we reduced token usage per task by about 70% and cut our monthly bill by 77% while improving operational resilience. How We Achieved 77% Cost Reduction and Provider Independence Over the past week, we rebuilt our AI infrastructure around OpenClaw’s multi-agent architecture. The result was a 77% cost reduction , provider independence , and a delegation system that routes work to the most cost-effective model for each job. Below is the technical journey of optimizing a 7-agent squad with OpenClaw. The Challenge: Model Provider Lock-In We started with a simple problem: our entire squad defaulted to a single model provider. This created three issues: Cost inefficiency beca...

How I built a secure, high-performance AI agent squad with OpenClaw

The short version: We run PaxMachina like an Airflow-style DAG, separating heavy lifting from reasoning to save tokens. We replaced generic vector stores with a specialized Query-Memory-Document (QMD) backend for high-velocity state. We treat Telegram channels as immutable event logs, not watercoolers. And we added a task ledger protocol that prevents the runaway loops plaguing other agent frameworks. AI agents are like Airflow for intelligence I used to think the bottleneck in agent systems was model intelligence. I was wrong. The bottleneck is context hygiene . If you treat an agent like a chatty intern, you burn tokens on coordination and lose state in the noise. The shift that made our system (PaxMachina) work was treating it like an ops pipeline. Specifically, like Airflow DAGs . We separated the "muscle" (gathering data) from the "brain" (reasoning), and we locked down how they talk to each other. If you've followed the recent OpenClaw ...