Pricing

Code Generation
Cerebras Code
Experience instant code completions with frontier models.
- Daily token limits beginning at 24M
- Standard 131k context length
- Instant code generation at up to 2,000 tokens per second
- Community support via Discord
Monthly subscription starting at $50/month
General API Access
Free
The easiest way to get started with Cerebras.
- Access to all Cerebras powered models
- The world’s fastest inference – 20x faster than OpenAI and Anthropic
- Community support via Discord
general api access
Developer
Generous rate limits for power users
Everything in Free, plus:
- Self-serve payment starting at just $10
- 10x higher rate limits than free tier
- Higher priority processing
general api access
Enterprise
Highest throughput, custom weights, and guaranteed uptime.
Everything in Developer, plus:
- Highest rate limits for production workloads
- Lowest latency with dedicated queue priority
- Support for custom model weights
- Model fine-tuning and training services
- Dedicated support team with response time guarantees
Developer tier Pricing
*Preview models are intended for evaluation purposes only, and are not intended for use in production environments. They may be discontinued at short notice.
**Models have been scheduled for deprecation as part of ongoing efforts to serve the most up-to-date models.
Partners
Get access to Cerebras Inference through our partner APIs