Modal Pricing 2026
Complete pricing guide with plans, hidden costs, and cost analysis
Modal has a free plan. Paid plans start at $250/GPU/hour (Team) and go up to $250/GPU/hour.
Are you Modal? Claim this profile
Modal costs Free to $250 per GPU/hour as of May 2026, with 3 plans available including a free tier. Plans: Starter (free), and Team at $250/GPU/hour. Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.
Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.
- Free tier: Yes
Modal offers 3 pricing tiers: Starter, Team, Enterprise. A free plan is available. Paid plans include Team at $250/month. The Team plan is startups and growing teams needing higher concurrency, custom domains, and collaboration features.
Compared to other ai/gpu cloud compute software, Modal is positioned at the mid-market price point.
- 2 documented hidden costs beyond list price
How much does Modal cost?
Modal Pricing Overview
Modal has 3 pricing plans, including a free tier. Paid plans range from $0 to $250/GPU/hour. The Starter plan is free and is best for individual developers and small teams getting started with serverless gpu compute. The Team plan costs $250/GPU/hour, best for startups and growing teams needing higher concurrency, custom domains, and collaboration features. The Enterprise plan requires contacting sales for a custom quote and is designed for organizations prioritizing security, compliance, dedicated support, and high-volume gpu compute.
Modal and annual price increases of $2-$72/hr per GPU.
There are at least 2 documented hidden costs beyond Modal's list price, including implementation, training, and add-on fees.
This pricing was last verified in May 6, 2026 from 3 independent sources.
Modal is a serverless compute platform that lets Python developers run GPU and CPU workloads in the cloud with zero infrastructure management. You write Python functions decorated with @app.function, and Modal handles scaling, containers, and billing. You pay only for active compute time — no idle charges, no minimum commitments. Modal is popular for ML inference, fine-tuning, batch jobs, and GPU-accelerated Python scripts. GPU pricing starts at $1.10/hr for A10G and goes up to $4.29/hr for H100 SXM.
How Modal Pricing Compares
Compare Modal pricing against top alternatives in AI/GPU Cloud Compute.
What does Modal actually cost you?
Drag the slider. Pick a tier. Watch your projected spend update live.
List price by tier (annualized, per seat)
Per-seat list price across Modal's plans, annualized. Custom-priced tiers show a hatched bar.
All Modal Plans & Pricing
| Plan | Monthly | Annual | Best For |
|---|---|---|---|
| Starter credits: $30/month compute creditsseats: Up to 3 | Free | Free | Individual developers and small teams getting started with serverless GPU compute |
| What's included at Starter Best for: Individual developers and small teams getting started with serverless GPU compute
Limits
| |||
| Team credits: $100/month compute creditsseats: Unlimited | $250 /month | Custom | Startups and growing teams needing higher concurrency, custom domains, and collaboration features |
| What's included at Team Best for: Startups and growing teams needing higher concurrency, custom domains, and collaboration features
Limits
| |||
| Enterprise | Contact Sales | Contact Sales | Organizations prioritizing security, compliance, dedicated support, and high-volume GPU compute |
| What's included at Enterprise Best for: Organizations prioritizing security, compliance, dedicated support, and high-volume GPU compute
| |||
View all features by plan (compare side-by-side)
Starter
- $30/month in free compute credits
- Up to 3 workspace seats
- 100 containers + 10 GPU concurrency
- 5 deployed crons, 8 deployed web endpoints
- 1 day log retention
- 200 deployed apps
- Real-time metrics and logs
- Region selection (1.25–2.5x base prices)
- SOC 2 compliance
- Community Slack support
Team
- $100/month in free compute credits
- Unlimited workspace seats
- 1000 containers + 50 GPU concurrency
- Unlimited crons and web endpoints
- 30 day log retention
- 1000 deployed apps
- Custom domains
- Static IP proxy
- Deployment rollbacks (custom versions)
- RBAC and SSO
- Community Slack support
Enterprise
- Volume-based compute discounts
- Unlimited workspace seats
- Custom GPU concurrency limits
- Custom container limits
- Embedded ML engineering services
- Support via private Slack
- Audit logs, Okta SSO, HIPAA compatibility
- Custom log retention
- AWS and GCP marketplace transacting
Usage-Based Rates
Per-unit pricing for Modal API usage.
Starter
| Model | Unit | Rate |
|---|---|---|
| Nvidia B200 | second | $0.00174 $6.25/hr — top-tier GPU |
| Nvidia H200 | second | $0.00126 $4.54/hr |
| Nvidia H100 (80GB) | second | $0.00110 $3.95/hr |
| Nvidia RTX PRO 6000 | second | $0.00084 $3.03/hr |
| Nvidia A100 80GB | second | $0.00069 $2.50/hr |
| Nvidia A100 40GB | second | $0.00058 $2.10/hr |
| Nvidia L40S (48GB) | second | $0.00054 $1.95/hr |
| Nvidia A10 (24GB) | second | $0.00031 $1.10/hr |
| Nvidia L4 | second | $0.00022 $0.80/hr |
| Nvidia T4 (16GB) | second | $0.00016 $0.59/hr — entry GPU |
| CPU (per physical core, 2 vCPU equiv) | second | $0.000013 $0.047/core-hr |
| Memory (per GiB) | second | $0.000002 $0.008/GiB-hr |
- Compute rates apply to all plans (Starter, Team, Enterprise)
- Billed per second of active execution — no idle charges
- Free credits automatically applied before billing
- Sandbox/Notebook CPU and memory priced higher (3x base)
- Non-preemptible execution available at 3x base prices
Compare Modal vs Alternatives
Before committing to Modal, compare pricing with these 3 alternatives in the same category.
What Companies Actually Pay for Modal
Modal Year 1 Total Cost by Company Size
Real deployment costs including licenses, implementation, training, and admin — not just the sticker price.
A solo developer using the free Starter plan for occasional on-demand GPU workloads. Compute is billed at ~$2/hr per GPU baseline with no monthly platform fee.
A small team on the Team plan deploying a 100B+ parameter model for product use. Large model serving can require multi-GPU setups at ~$72/hr, making this viable for shared usage across many users but prohibitively expensive for low-utilization individual deployments.
HN community (sid-the-kid, 2025-04-24)
How Modal Pricing Compares
| Software | Starting Price | Top Price |
|---|---|---|
| Modal | Free | $250/GPU/hour |
| CoreWeave | $10/instance/hour | $68.8/instance/hour |
| Hyperbolic | $0.3/GPU/hour | $3.2/GPU/hour |
| Lambda | $0.69/GPU/hour | $6.99/GPU/hour |
| Paperspace | Free | $39/GPU/hour |
| RunPod | $0.34/GPU/hour | $3.49/GPU/hour |
Detailed pricing comparisons:
Modal Contract Terms
Modal contracts do not auto-renew. Changes require advance notice. These terms are sourced from verified buyer experiences.
Modal Pricing FAQ
01 How much does Modal cost?
Modal charges per second of active compute. GPU pricing ranges from $1.10/hr for A10G/T4 to $4.29/hr for H100 SXM. CPU is $0.000306/vCPU-second. All new accounts get $30/month in free compute credits.
02 Does Modal have a free tier?
Yes — Modal's Starter plan includes $30/month in compute credits at no cost. These credits apply to any GPU or CPU workload, making it free for light personal use and experimentation.
03 How does Modal billing work?
Modal bills per millisecond of actual execution. There are no idle charges — you only pay when your function is actively running. This makes Modal very cost-effective for sporadic workloads compared to always-on servers.
04 What GPUs does Modal support?
Modal supports T4, A10G, L40S, A100 (40GB PCIe and 80GB SXM), H100 (PCIe and SXM), and H200. GPU availability varies; H100 and H200 require selection via the SDK.
05 Modal vs RunPod: which is cheaper?
For sporadic/bursty workloads, Modal is cheaper due to zero idle charges. For sustained heavy usage, RunPod's Secure Cloud or Community Cloud (spot at ~50% off) is cheaper per hour. Modal A10G costs $1.10/hr vs RunPod A10G at ~$0.69/hr on-demand — but Modal's billing precision eliminates wasted compute.
06 Is the Starter or Team plan fee the total cost, or are there additional compute charges?
The plan fee ($0 for Starter, $250/month for Team) is a platform access charge only. GPU compute is billed separately per-hour on top of the plan. Baseline GPU costs start around $2/hr per GPU, H100s run ~$4.5/hr, and serving large models can reach ~$72/hr. Actual monthly spend depends entirely on compute utilization.
07 Does Modal have cold start issues for production workloads?
Yes. Modal's serverless architecture spins containers on-demand rather than keeping GPUs always-on. This produces cold start latency when functions haven't been called recently. Users also need to configure their own inference stack (e.g., vLLM) rather than using a managed inference endpoint.
08 How does Modal GPU pricing compare to RunPod?
Based on community comparisons, Modal's H100 runs ~$4.5/hr versus RunPod's H100 at ~$2.5/hr. RunPod also offers lower-tier GPUs like RTX A5000 and RTX 3090 at ~$0.22/hr. Modal's higher pricing reflects its managed serverless infrastructure and simpler deployment model. Check modal.com/pricing directly for current GPU-specific rates.
09 Do I pay for container uptime or only execution time on Modal?
This is a common question. Modal's on-demand model is designed around paying for execution time rather than idle GPU time, which is its core value proposition versus always-on GPU providers. However, confirm the specific billing model for your workload type at modal.com/pricing, as container warm-up and minimum billing increments may apply.
Is this pricing incorrect? — we'll verify and update it.