OpenAI is slashing token prices. GPT 5.6 arrives end of June. Fable 5 costs $10/$50. Here's what the price war means for teams paying API bills — with full pricing tables and cost-per-task data.
Key takeaways:
- OpenAI confirmed "weighing significant token price cuts" — WSJ, June 10
- GPT 5.6 expected end of June — positioned to undercut Fable 5 on price
- Current gap: GPT-5.5 at $5/$30 vs Fable 5 at $10/$50 per million tokens
- OpenAI burning $27B in 2026 cash — price cuts are survival, not generosity
- Enterprises pulling back: Uber blew its AI budget by April, usage down 20-30% at some firms
The Wall Street Journal reported on June 10 that OpenAI is considering "drastic" token price reductions. Two days earlier, Anthropic launched Claude Fable 5 — the most expensive public model either company has released. Both are preparing for IPOs this year.
This is not a gift to developers. It's a survival war. Here's what it actually means for your budget.
Full API pricing as of June 12, 2026:
| Model (Provider) | Input | Output | Context | vs Fable 5 |
|---|---|---|---|---|
| Claude Fable 5 (Anthropic) | $10.00 | $50.00 | 1M | baseline |
| GPT-5.5 Pro (OpenAI) | $15.00 | $60.00 | 128K | 1.2x output |
| Claude Fable 5 cached (Anthropic) | $2.50 | $50.00 | 1M | 4x cheaper input |
| GPT-5.5 (OpenAI) | $5.00 | $30.00 | 128K | 2x cheaper |
| Claude Opus 4.8 (Anthropic) | $5.00 | $25.00 | 200K | 2x cheaper |
| Claude Sonnet 4.6 (Anthropic) | $3.00 | $15.00 | 200K | 3x cheaper |
| GPT-5.4 (OpenAI) | $2.50 | $15.00 | 128K | 4x cheaper |
| Gemini 3.1 Pro (Google) | $2.00 | $12.00 | 2M | 4x cheaper |
| Claude Haiku 4.5 (Anthropic) | $1.00 | $5.00 | 200K | 10x cheaper |
| GPT-5.4 Nano (OpenAI) | $0.20 | $1.25 | 128K | 40x cheaper |
GPT 5.6 (expected end of June): Pricing unconfirmed. Based on OpenAI's stated intent to compete with Fable 5, estimated $5-8 input / $25-40 output.
Uber blew their entire 2026 AI budget by April. Monthly costs per engineer hit $500-$2,000. Some firms report token usage down 20-30% as companies impose hard limits. And that was on Opus 4.8 — Fable 5 is 2x the price.
Here's what common engineering tasks actually cost:
| Task | Tokens (in/out) | Fable 5 ($10/$50) | GPT-5.5 ($5/$30) | Opus 4.8 ($5/$25) | Sonnet 4.6 ($3/$15) |
|---|---|---|---|---|---|
| Code review (500 LOC) | 8K / 2K | $0.18 | $0.10 | $0.09 | $0.05 |
| Bug fix + explanation | 5K / 4K | $0.25 | $0.15 | $0.13 | $0.08 |
| Feature implementation | 10K / 8K | $0.50 | $0.29 | $0.25 | $0.15 |
| Full file refactor | 15K / 12K | $0.75 | $0.44 | $0.38 | $0.23 |
| 1-hour agent loop | 50K / 200K | $10.50 | $6.25 | $5.25 | $3.15 |
| Multi-day autonomous task | 500K / 2M | $105.00 | $62.50 | $52.50 | $31.50 |
For agent loops running hours or days, the gap compounds to hundreds of dollars per session. One developer reported R$2,200 (~$400) for a 50-minute code audit using 96 parallel agents on Fable 5.
Yesterday's flagship gets cheaper. Today's frontier gets more expensive:
| Model Generation | Year | Input / 1M | Output / 1M | Change |
|---|---|---|---|---|
| GPT-4 | 2023 | $30.00 | $60.00 | baseline |
| Claude Opus 4.0 | 2024 | $15.00 | $75.00 | — |
| GPT-4o | 2024 | $2.50 | $10.00 | -92% vs GPT-4 |
| Claude Opus 4.5/4.6 | 2025 | $5.00 | $25.00 | -66.7% vs Opus 4.0 |
| GPT-5.4 | 2025 | $2.50 | $15.00 | -92% vs GPT-4 |
| GPT-5.5 | Apr 2026 | $5.00 | $30.00 | +100% vs GPT-5.4 |
| Claude Fable 5 | Jun 2026 | $10.00 | $50.00 | +100% vs Opus 4.6 |
Both GPT-5.5 and Fable 5 doubled their predecessor's price. The "price cuts" OpenAI is considering will apply to older models — not the newest frontier ones.
$200/month Claude Max subscriptions are burning out in 30-90 minutes on Fable 5. OpenRouter found ChatGPT Pro 20x delivers ~$14,000 in compute for $200 — a subsidy the WSJ reports is unsustainable.
| Plan | Monthly Cost | Agent Credits | Break-even (Fable) | Reality |
|---|---|---|---|---|
| Claude Pro | $20/mo | $20 | ~12 min heavy use | Burns in one session |
| Claude Max 5x | $100/mo | $100 | ~60 min heavy use | Gone in 1-2 hours |
| Claude Max 20x | $200/mo | $200 | ~90 min heavy use | 30-90 min reported |
| ChatGPT Pro 20x | $200/mo | ~$14,000 compute | Massively subsidized | Not sustainable |
| Direct API | Pay-per-use | Unlimited | Full cost, full control | Real visibility |
Critical dates: June 15 — Agent SDK billing splits to separate pool. June 22 — Fable free access ends. End of June — GPT 5.6 expected.
| Model | SWE-Bench Pro | Output / 1M | Score Per $1 | Rank |
|---|---|---|---|---|
| Claude Sonnet 4.6 | ~55% (est.) | $15.00 | 3.7 | 1st |
| Claude Opus 4.8 | 69.2% | $25.00 | 2.8 | 2nd |
| GPT-5.5 | 58.6% | $30.00 | 2.0 | 3rd |
| Claude Fable 5 | 80.3% | $50.00 | 1.6 | 4th |
Fable 5 is the most capable but worst value per dollar. Sonnet 4.6 delivers the best benchmark score per dollar spent. Opus 4.8 is the sweet spot for teams that need quality without Fable pricing.
Anthropic forced their hand. Fable 5 scores 80.3% on SWE-Bench Pro vs GPT-5.5's 58.6%. OpenAI can't win on benchmarks yet, so they compete on price.
Enterprises are pulling back. Altman acknowledged costs are "a huge issue" for corporate clients. Some tools show token usage down 20-30% as companies restrict access.
IPO pressure. OpenAI projects $27 billion in 2026 cash burn and won't profit until 2029. Anthropic's Series H valued it at $965B. Both need user growth for their roadshows — price cuts drive adoption.
On June 13, the US government issued an export control directive suspending all Fable 5 and Mythos 5 access for foreign nationals. Anthropic was forced to disable the model globally to ensure compliance. Every team that went all-in on Fable 5 lost access overnight — zero notice, zero workaround.
This is the vendor lock-in risk nobody priced in. The most powerful model in the world is also the most fragile dependency you can build on. Multi-provider routing is not just about cost optimization. It is business continuity.
Cheaper tokens don't mean smaller bills. Every price cut in history has been followed by higher total spend — because more capable models at lower prices drive more usage.
Track $/task, not $/token. A model at 2x the token price that completes tasks in half the tokens costs the same. Measure cost per successful outcome — read our full guide on reducing AI API costs.
Build provider flexibility. Teams locked into one provider can't capitalize on competitor price drops. Multi-provider routing is insurance against pricing volatility.
Set budget caps before agents scale. Unsupervised agent loops at lower per-token rates burn through budget faster. Lower price × higher volume = same bill.
Watch the June dates. Three pricing events in three weeks — each changes the math. See our full Fable 5 pricing breakdown for what the post-June-22 world looks like.
How much will GPT 5.6 cost per million tokens?
Not released yet (expected end of June 2026). We expect $5-8 input / $25-40 output. This article will be updated with confirmed pricing.
Is OpenAI cutting prices in 2026?
Yes. WSJ confirmed June 10 that OpenAI is "weighing significant token price cuts." Specific amounts unconfirmed.
Which is cheaper: GPT-5.5 or Claude Fable 5?
GPT-5.5 at $5/$30 is 2x cheaper than Fable 5 at $10/$50. But Fable scores 80.3% vs 58.6% on SWE-Bench Pro — so GPT-5.5 may need more iterations on complex tasks.
When does Claude Fable 5 free access end?
June 22, 2026. After that, usage credits are required on top of Pro/Max subscriptions.
This article will be updated when GPT 5.6 launches with confirmed pricing. Last updated: June 12, 2026.
Want to cut your AI costs?
CostLens routes simple prompts to cheaper models automatically.