How I cut multi-agent LLM costs by 70% using prompt caching, artifact minification, and smarter execution order — and what the AWAF assessment found along the way.
Your AI Agents Are Leaking Tokens, Are You Aware?
A collection of 3 posts
How I cut multi-agent LLM costs by 70% using prompt caching, artifact minification, and smarter execution order — and what the AWAF assessment found along the way.
I built an agentic system to help anyone feeling the labor market shift think clearly about what comes next. Then I scored it against my own framework. It got a 52
Every team is bolting agentic AI onto everything right now. It's giving pure Agent Smith energy. And we're about to pay the same price we paid with microservices unless we build a framework to stop it.