Why is Amazon looking for cheaper AI models?

According to The Information, Amazon is weighing the use of OpenAI's models and its own Nova models in some products to cut costs after Anthropic raised the price of its Claude models. It is a cost decision: match the model to the job rather than run everything through the most expensive option.

How does AI model pricing work?

Most providers charge by the token, roughly a few characters of text, for both the prompt you send and the answer you get back. Prices vary enormously between models, often 10x or 20x, so flagship models cost far more than smaller ones for the same task.

What is the most common mistake businesses make with AI cost?

Optimizing the model before fixing the process, and running every task through the most expensive model. Map the workflow first, separate routine volume from genuine exceptions, then route each step to the appropriate model.

Which AI models should a small business consider?

The major families are Anthropic's Claude, OpenAI's GPT, and Google's Gemini, plus Amazon's lower-cost Nova models and open-weight options. Each family has budget and premium tiers, and most businesses should use both rather than picking a single model for everything.

← ALL ARTICLES29 June 2026

Even Amazon Is Shopping for Cheaper AI. What That Means for Your Business

Why is Amazon suddenly looking for cheaper AI?

According to reporting from The Information, Amazon is weighing whether to use OpenAI's models and its own in-house Nova models inside some Amazon products, partly to cut costs after Anthropic raised the price of using its Claude models. In plain terms: one of the largest, most sophisticated technology buyers on earth looked at its AI bill, decided it was too high, and started shopping around. If Amazon is doing that, the lesson for a small or mid business is simple. You are allowed to do it too.

The headline matters because it kills a myth. Many owners assume AI is a single product you either buy or you do not. It is not. It is a market of competing models at wildly different prices, and the right answer is almost never "pick one and run everything through it."

How does AI model pricing actually work?

Most AI providers charge by the token, which is roughly a few characters of text. You pay for what goes in (your prompt) and what comes out (the answer). The catch is that prices between models differ by a lot, sometimes 10x or 20x for the same task.

The big, flagship models (the ones that top the leaderboards) are the expensive ones. They are genuinely better at hard reasoning, nuanced writing, and tricky judgment calls. But most of the work a business actually feeds an AI is not hard. Sorting an email into a category, pulling a phone number off an invoice, drafting a first-pass reply, tagging a support ticket. That work does not need the most powerful model on the market. It needs a cheap, fast one that gets it right.

This is exactly the move Amazon is reportedly making. It is not abandoning quality. It is matching the model to the job, and keeping the premium model for the work that truly needs it.

What is model routing, and why should you care?

Routing means sending each task to the cheapest model that can do it well, and only escalating to an expensive model when the job warrants it. Think of it like staffing. You do not put your most senior person on every email. You let junior staff handle the routine volume and bring in the expert for the calls that matter.

Here is the pattern in practice. A customer message comes in. A cheap model reads it, classifies it, and drafts a reply for the easy 80 percent. Anything ambiguous, sensitive, or high-value gets flagged and handed to a stronger model, or to a human. You pay premium prices only for the small slice of output that actually ships to a customer.

In our experience helping businesses put AI to practical use, this single decision is where most of the savings live. When someone tells us "AI is too expensive," it is almost never an AI problem. It is a routing problem. They wired every task, no matter how trivial, into the most expensive model available and then got surprised by the bill.

Which AI models should a small business actually consider?

You do not need to memorize a catalogue, but it helps to know the shape of the market. The major families are Anthropic's Claude, OpenAI's GPT models, and Google's Gemini, each offering a tiered range from small and cheap to large and powerful. Amazon's Nova models are its own lower-cost option. Open-weight models you can run more cheaply also exist for teams with the technical appetite.

The practical takeaway is not "which brand wins." It is that within every family there is a budget tier and a premium tier, and you should be using both. Avoid lock-in where you can. The reason Amazon can even consider switching is that its systems were not welded to one provider. Build the same flexibility, even loosely, and price increases become a shrug instead of a crisis.

What is the common mistake businesses make here?

The biggest one is optimizing the model before fixing the process. Swapping to a cheaper model on a workflow that is poorly designed just makes a bad process cheaper to run badly. Map the workflow first. Figure out which steps are genuinely high-stakes and which are routine volume. Then route accordingly.

The second mistake is chasing the leaderboard. The model that ranks first this month is rarely the one your invoice-tagging task needs. Novelty is not the goal. Boring, measurable results are: time saved, errors avoided, faster responses. Measure those, not the brand on the box.

What should you do about it?

You do not need a big project. Pick one task you already run through AI, or one repetitive job you have been meaning to automate. Ask two questions. Does this step genuinely need the smartest model, or would a cheaper one do it just as well? And is there a clean line between the routine 80 percent and the exceptions that need a human or a premium model?

If you can answer those, you can usually cut your AI cost meaningfully without touching quality, the same logic Amazon is reportedly applying at its own scale. Start with one workflow that visibly pays for itself, prove it, then compound. That beats a big-bang overhaul every time.

Want this handled for you?

Odyssey builds AI-powered automation for Australian businesses. We map the workflow, build the system, and keep it running.

GET A FREE AUDIT →