How I cut multi-agent LLM costs by 70% using prompt caching, artifact minification, and smarter execution order — and what the AWAF assessment found along the way.
Your AI Agents Are Leaking Tokens, Are You Aware?
A collection of 46 posts
How I cut multi-agent LLM costs by 70% using prompt caching, artifact minification, and smarter execution order — and what the AWAF assessment found along the way.
I built an agentic system to help anyone feeling the labor market shift think clearly about what comes next. Then I scored it against my own framework. It got a 52
Every team is bolting agentic AI onto everything right now. It's giving pure Agent Smith energy. And we're about to pay the same price we paid with microservices unless we build a framework to stop it.
Agent talk is everywhere. In client meetings and hallway conversations, the theme keeps coming up. Social media can make it feel like agents will soon be serving your next coffee. Maybe one day. Today, we are not there yet. Over…
Are you familiar with the Well-Architected Framework (WAF)? It's an essential tool for ensuring that your cloud computing infrastructure is secure, efficient, reliable, and cost-effective. Neglecting to use of the WAF can result in a range of issues,…