, June 21, 2026

Companies Discover GPT-4 Costs Money, Invent Cost-Cutting


Companies are shifting from running everything on the most powerful AI model to matching each task to the right one, a practice called model routing.

  •   2 min reads
Companies Discover GPT-4 Costs Money, Invent Cost-Cutting

Table of content

Model routing is the practice of using cheaper AI models for simple tasks instead of burning through premium tokens like a retail trader discovering options for the first time. Companies looked at their bills from OpenAI and Anthropic. They did what any rational business does when faced with unnecessary expense. They stopped paying for it.

This breakthrough strategy involves asking whether you need the Ferrari AI to tell you what day it is. Turns out you don't. A Honda AI works fine. Revolutionary thinking from the same executives who spent eighteen months claiming AI would transform everything before remembering that profit margins exist.

OpenAI and Anthropic built their entire business model on convincing people that every query requires maximum computational firepower. Send an email? GPT-4. Check the weather? Claude Opus. Calculate a tip? Better consult the $200-per-million-token oracle just to be safe. Companies bought this pitch because nobody wants to be the guy who chose the cheap AI and missed the revolution. Fear-driven purchasing at its finest.

Model routing kills this beautiful arrangement. Some intern at a Fortune 500 company ran the numbers. Ninety percent of their AI queries could run on models that cost one-tenth the price. Management praised this insight as innovative cost optimization. The intern learned an important lesson about corporate life: obvious solutions are considered genius if nobody implemented them yet.

OpenAI and Anthropic will adapt. They'll bundle their models into convenient tiers. They'll offer enterprise discounts that make routing unnecessary. They'll release white papers explaining why model routing actually increases costs when you factor in implementation overhead and strategic synergies. Marketing departments are already workshopping the phrase "integrated model ecosystems" as we speak.

The best part? Companies will spend six months building sophisticated routing infrastructure to save money on AI. They'll hire consultants. They'll form task forces. They'll create routing algorithms that decide which model handles which task. The total cost of this efficiency project will exceed what they would have spent just overpaying for GPT-4 forever. But at least they'll have a dashboard showing the savings.

Nothing says cutting-edge technology like rediscovering that buying exactly what you need costs less than buying everything.

Photo by on Unsplash

Related Posts

If you got something out of this, Phil McCandlestick accepts tips. No pressure — the chart was free.

Leave a Tip