Quick Answer
Last verified:
Medium confidence

Cloudflare Workers AI costs Free to $4.88 per per million tokens as of May 2026, with 3 plans available including a free tier. Plan: Free tier (free). Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.

Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.

  • Free tier: Yes

Cloudflare Workers AI offers 3 pricing tiers: Free tier, Pay-as-you-go (Neurons / tokens), Enterprise. The Pay-as-you-go (Neurons / tokens) plan is latency-sensitive inference at the edge.

Compared to other llm api providers software, Cloudflare Workers AI is positioned at the budget-friendly price point.

  • 1 documented hidden costs beyond list price

How much does Cloudflare Workers AI cost?

Cloudflare Workers AI offers 3 pricing plans, starting with a free tier and scaling to custom enterprise pricing. Plans include Free tier (free), Pay-as-you-go (Neurons / tokens) (custom pricing), Enterprise (custom pricing).

Cloudflare Workers AI Pricing Overview

Cloudflare Workers AI has 3 pricing plans, including a free tier. Paid plans range from $0 to $4.88/per million tokens. The Free tier plan is free and is best for prototyping + low-volume production at the edge. The Pay-as-you-go (Neurons / tokens) plan requires contacting sales for a custom quote and is designed for latency-sensitive inference at the edge. The Enterprise plan requires contacting sales for a custom quote and is designed for high-volume production at the edge.

There are at least 1 documented hidden costs beyond Cloudflare Workers AI's list price, including implementation, training, and add-on fees.

This pricing was last verified in May 19, 2026 from 1 independent sources.

Cloudflare Workers AI pricing starts at $0/month on the Free tier, which includes a daily Neuron allocation for AI inference at the edge. For workloads that exceed the free quota, the Pay-as-you-go (Neurons / tokens) plan charges based on actual compute consumed with no monthly minimum. Enterprise pricing is custom-quoted for organizations requiring dedicated capacity, guaranteed throughput, or enterprise SLAs.

How Cloudflare Workers AI Pricing Compares

Compare Cloudflare Workers AI pricing against top alternatives in LLM API Providers.

All Cloudflare Workers AI Plans & Pricing

Plan Monthly Annual Best For
View all features by plan (compare side-by-side)

Free tier

  • 10,000 Neurons/day free allocation
  • Included in both Workers Free and Workers Paid plans
  • Access to all hosted open-source models
  • Resets daily at 00:00 UTC

Pay-as-you-go (Neurons / tokens)

  • $0.011 per 1,000 Neurons beyond 10,000/day free allocation
  • Llama 3.3 70B: $0.293/1M input, $2.253/1M output
  • Llama 3.1 8B (fp8): $0.152/1M input, $0.287/1M output
  • Mistral 7B: $0.110/1M input, $0.190/1M output
  • DeepSeek R1 Distill 32B: $0.497/1M input, $4.881/1M output
  • GPT-OSS 120B: $0.350/1M input, $0.750/1M output
  • No cold-starts (models pre-loaded at edge PoPs)

Enterprise

  • Committed-use discounts
  • Dedicated edge capacity
  • SLAs
Compare Cloudflare Workers AI with alternativesAdjust seats, lock a tier, add up to 2 more products side-by-side. Shareable URL.

Usage-Based Rates

Per-unit pricing for Cloudflare Workers AI API usage.

Pay-as-you-go (Neurons / tokens)

Model Input Output Cached Per
llama-3-2-1b-instruct 131K ctx $0.027 $0.201 1M tokens
llama-3-2-3b-instruct 131K ctx $0.051 $0.335 1M tokens
llama-3-1-8b-instruct-fp8-fast 131K ctx $0.045 $0.384 1M tokens
llama-3-1-8b-instruct-fp8 131K ctx $0.152 $0.287 1M tokens
llama-3-1-8b-instruct 131K ctx $0.282 $0.827 1M tokens
llama-3-2-11b-vision-instruct 131K ctx $0.049 $0.676 1M tokens
llama-3-1-70b-instruct-fp8-fast 131K ctx $0.293 $2.25 1M tokens
llama-3-3-70b-instruct-fp8-fast 131K ctx $0.293 $2.25 1M tokens
deepseek-r1-distill-qwen-32b 80K ctx $0.497 $4.88 1M tokens
mistral-7b-instruct-v0-1 8K ctx $0.110 $0.190 1M tokens
mistral-small-3-1-24b-instruct 131K ctx $0.351 $0.555 1M tokens
llama-4-scout-17b-16e-instruct 10000K ctx $0.270 $0.850 1M tokens
gemma-3-12b-it 131K ctx $0.345 $0.556 1M tokens
qwq-32b 131K ctx $0.660 $1.00 1M tokens
qwen2-5-coder-32b-instruct 33K ctx $0.660 $1.00 1M tokens
qwen3-30b-a3b-fp8 262K ctx $0.051 $0.335 1M tokens
gpt-oss-120b 131K ctx $0.350 $0.750 1M tokens
gpt-oss-20b 131K ctx $0.200 $0.300 1M tokens
kimi-k2-5 262K ctx $0.600 $3.00 $0.100 1M tokens
kimi-k2-6 262K ctx $0.950 $4.00 $0.160 1M tokens
nemotron-3-120b-a12b 131K ctx $0.500 $1.50 1M tokens
  • All billing converts to Neurons at $0.011 per 1,000 Neurons behind the scenes
  • Neurons = AI-specific compute units representing GPU compute per request
  • Free allocation of 10,000 Neurons/day applies before metered usage
  • Embeddings, image, and audio models priced separately (not shown — different units)

Compare Cloudflare Workers AI vs Alternatives

Before committing to Cloudflare Workers AI, compare pricing with these 3 alternatives in the same category.

All Cloudflare Workers AI alternatives & migration guides

What Companies Actually Pay for Cloudflare Workers AI

Review scores

How Cloudflare Workers AI Pricing Compares

Software Starting Price Top Price
Cloudflare Workers AI Free $4.881/per million tokens
Amazon Bedrock $0.07/per million tokens $75/per million tokens
Anyscale $0.15/per million tokens $5/per million tokens
Baidu ERNIE API $0.1/per million tokens $10/per million tokens
Cerebras Inference API $0.1/per million tokens $6/per million tokens
Claude API $0.03/per million tokens $75/per million tokens

1 Cloudflare Workers AI Hidden Costs Beyond the List Price

Beyond the listed price, Cloudflare Workers AI has at least 1 documented hidden costs that can significantly increase total cost of ownership.

Watch for 1 hidden costs
  • Undocumented Usage Limits Forcing Enterprise Upgrade 5-15% of license costs
    high 1 source
    Reddit "There are undocumented, unknown limits where if you have enough usage, they will try to force you into going to an Enterprise plan."
Tip

Ask your Cloudflare Workers AI sales rep about these costs upfront. Getting them in writing before signing can save you from surprise charges later.

Full hidden costs breakdown →

Intelligence sourced from 1 independent sources
Reddit User discussions
Key claims include inline source attribution. Data verified against multiple independent sources. 2 source citations total.

How to Negotiate Cloudflare Workers AI Pricing

Cloudflare Workers AI contracts are negotiable. These 1 tactics are sourced from real buyer experiences and procurement specialists.

Negotiation Playbook 1 tactics
Expect Immediate Discount Off Enterprise List Price high success

Cloudflare enterprise sales representatives are reported to immediately offer 25% off list price. For Cloudflare Enterprise services, list pricing has been cited around $260/month, with volume discounts bringing costs closer to $200/month. Do not accept the first offer — open by asking for a volume discount before any concessions are made.

reddit

Full negotiation guide →

Cloudflare Workers AI Pricing FAQ

01 Does Cloudflare Workers AI have a free tier?

Yes. Cloudflare Workers AI includes a Free tier at $0/month. Beyond free-tier limits, usage is billed on a Pay-as-you-go basis priced in Neurons — Cloudflare's unit for AI compute. Enterprise pricing is custom-quoted for teams needing guaranteed capacity or SLAs.

02 What happens if I exceed the free tier on Cloudflare Workers AI?

You move to the Pay-as-you-go (Neurons / tokens) plan, which charges based on actual inference usage. Note that Cloudflare has reported undocumented usage thresholds beyond which they may push users toward an Enterprise contract.

Is this pricing incorrect? — we'll verify and update it.