Cloudflare Workers AI Pricing 2026
Complete pricing guide with plans, hidden costs, and cost analysis
Cloudflare Workers AI pricing ranges from $0 to $4.88/per million tokens.
Are you Cloudflare Workers AI? Claim this profile
Cloudflare Workers AI costs Free to $4.88 per per million tokens as of May 2026, with 3 plans available including a free tier. Plan: Free tier (free). Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.
Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.
- Free tier: Yes
Cloudflare Workers AI offers 3 pricing tiers: Free tier, Pay-as-you-go (Neurons / tokens), Enterprise. The Pay-as-you-go (Neurons / tokens) plan is latency-sensitive inference at the edge.
Compared to other llm api providers software, Cloudflare Workers AI is positioned at the budget-friendly price point.
- 1 documented hidden costs beyond list price
How much does Cloudflare Workers AI cost?
Cloudflare Workers AI Pricing Overview
Cloudflare Workers AI has 3 pricing plans, including a free tier. Paid plans range from $0 to $4.88/per million tokens. The Free tier plan is free and is best for prototyping + low-volume production at the edge. The Pay-as-you-go (Neurons / tokens) plan requires contacting sales for a custom quote and is designed for latency-sensitive inference at the edge. The Enterprise plan requires contacting sales for a custom quote and is designed for high-volume production at the edge.
There are at least 1 documented hidden costs beyond Cloudflare Workers AI's list price, including implementation, training, and add-on fees.
This pricing was last verified in May 19, 2026 from 1 independent sources.
Cloudflare Workers AI pricing starts at $0/month on the Free tier, which includes a daily Neuron allocation for AI inference at the edge. For workloads that exceed the free quota, the Pay-as-you-go (Neurons / tokens) plan charges based on actual compute consumed with no monthly minimum. Enterprise pricing is custom-quoted for organizations requiring dedicated capacity, guaranteed throughput, or enterprise SLAs.
How Cloudflare Workers AI Pricing Compares
Compare Cloudflare Workers AI pricing against top alternatives in LLM API Providers.
All Cloudflare Workers AI Plans & Pricing
| Plan | Monthly | Annual | Best For |
|---|---|---|---|
| Free tier | Free | Free | Prototyping + low-volume production at the edge |
| What's included at Free tier Best for: Prototyping + low-volume production at the edge
| |||
| Pay-as-you-go (Neurons / tokens) | Custom | Custom | Latency-sensitive inference at the edge |
| What's included at Pay-as-you-go (Neurons / tokens) Best for: Latency-sensitive inference at the edge
| |||
| Enterprise | Contact Sales | Contact Sales | High-volume production at the edge |
| What's included at Enterprise Best for: High-volume production at the edge
| |||
View all features by plan (compare side-by-side)
Free tier
- 10,000 Neurons/day free allocation
- Included in both Workers Free and Workers Paid plans
- Access to all hosted open-source models
- Resets daily at 00:00 UTC
Pay-as-you-go (Neurons / tokens)
- $0.011 per 1,000 Neurons beyond 10,000/day free allocation
- Llama 3.3 70B: $0.293/1M input, $2.253/1M output
- Llama 3.1 8B (fp8): $0.152/1M input, $0.287/1M output
- Mistral 7B: $0.110/1M input, $0.190/1M output
- DeepSeek R1 Distill 32B: $0.497/1M input, $4.881/1M output
- GPT-OSS 120B: $0.350/1M input, $0.750/1M output
- No cold-starts (models pre-loaded at edge PoPs)
Enterprise
- Committed-use discounts
- Dedicated edge capacity
- SLAs
Usage-Based Rates
Per-unit pricing for Cloudflare Workers AI API usage.
Pay-as-you-go (Neurons / tokens)
| Model | Input | Output | Cached | Per |
|---|---|---|---|---|
| llama-3-2-1b-instruct 131K ctx | $0.027 | $0.201 | — | 1M tokens |
| llama-3-2-3b-instruct 131K ctx | $0.051 | $0.335 | — | 1M tokens |
| llama-3-1-8b-instruct-fp8-fast 131K ctx | $0.045 | $0.384 | — | 1M tokens |
| llama-3-1-8b-instruct-fp8 131K ctx | $0.152 | $0.287 | — | 1M tokens |
| llama-3-1-8b-instruct 131K ctx | $0.282 | $0.827 | — | 1M tokens |
| llama-3-2-11b-vision-instruct 131K ctx | $0.049 | $0.676 | — | 1M tokens |
| llama-3-1-70b-instruct-fp8-fast 131K ctx | $0.293 | $2.25 | — | 1M tokens |
| llama-3-3-70b-instruct-fp8-fast 131K ctx | $0.293 | $2.25 | — | 1M tokens |
| deepseek-r1-distill-qwen-32b 80K ctx | $0.497 | $4.88 | — | 1M tokens |
| mistral-7b-instruct-v0-1 8K ctx | $0.110 | $0.190 | — | 1M tokens |
| mistral-small-3-1-24b-instruct 131K ctx | $0.351 | $0.555 | — | 1M tokens |
| llama-4-scout-17b-16e-instruct 10000K ctx | $0.270 | $0.850 | — | 1M tokens |
| gemma-3-12b-it 131K ctx | $0.345 | $0.556 | — | 1M tokens |
| qwq-32b 131K ctx | $0.660 | $1.00 | — | 1M tokens |
| qwen2-5-coder-32b-instruct 33K ctx | $0.660 | $1.00 | — | 1M tokens |
| qwen3-30b-a3b-fp8 262K ctx | $0.051 | $0.335 | — | 1M tokens |
| gpt-oss-120b 131K ctx | $0.350 | $0.750 | — | 1M tokens |
| gpt-oss-20b 131K ctx | $0.200 | $0.300 | — | 1M tokens |
| kimi-k2-5 262K ctx | $0.600 | $3.00 | $0.100 | 1M tokens |
| kimi-k2-6 262K ctx | $0.950 | $4.00 | $0.160 | 1M tokens |
| nemotron-3-120b-a12b 131K ctx | $0.500 | $1.50 | — | 1M tokens |
- All billing converts to Neurons at $0.011 per 1,000 Neurons behind the scenes
- Neurons = AI-specific compute units representing GPU compute per request
- Free allocation of 10,000 Neurons/day applies before metered usage
- Embeddings, image, and audio models priced separately (not shown — different units)
Compare Cloudflare Workers AI vs Alternatives
Before committing to Cloudflare Workers AI, compare pricing with these 3 alternatives in the same category.
What Companies Actually Pay for Cloudflare Workers AI
How Cloudflare Workers AI Pricing Compares
| Software | Starting Price | Top Price |
|---|---|---|
| Cloudflare Workers AI | Free | $4.881/per million tokens |
| Amazon Bedrock | $0.07/per million tokens | $75/per million tokens |
| Anyscale | $0.15/per million tokens | $5/per million tokens |
| Baidu ERNIE API | $0.1/per million tokens | $10/per million tokens |
| Cerebras Inference API | $0.1/per million tokens | $6/per million tokens |
| Claude API | $0.03/per million tokens | $75/per million tokens |
Detailed pricing comparisons:
How to Negotiate Cloudflare Workers AI Pricing
Cloudflare Workers AI contracts are negotiable. These 1 tactics are sourced from real buyer experiences and procurement specialists.
Cloudflare enterprise sales representatives are reported to immediately offer 25% off list price. For Cloudflare Enterprise services, list pricing has been cited around $260/month, with volume discounts bringing costs closer to $200/month. Do not accept the first offer — open by asking for a volume discount before any concessions are made.
redditCloudflare Workers AI Pricing FAQ
01 Does Cloudflare Workers AI have a free tier?
Yes. Cloudflare Workers AI includes a Free tier at $0/month. Beyond free-tier limits, usage is billed on a Pay-as-you-go basis priced in Neurons — Cloudflare's unit for AI compute. Enterprise pricing is custom-quoted for teams needing guaranteed capacity or SLAs.
02 What happens if I exceed the free tier on Cloudflare Workers AI?
You move to the Pay-as-you-go (Neurons / tokens) plan, which charges based on actual inference usage. Note that Cloudflare has reported undocumented usage thresholds beyond which they may push users toward an Enterprise contract.
Is this pricing incorrect? — we'll verify and update it.