Skip to main content

Pricing

Code Generation

Cerebras Code

Experience instant code completions with frontier models.

  • Daily token limits beginning at 24M
  • Standard 131k context length
  • Instant code generation at up to 2,000 tokens per second
  • Community support via Discord

Monthly subscription starting at $50/month

Signup
General API Access

Free

The easiest way to get started with Cerebras.

  • Access to all Cerebras powered models​
  • The world’s fastest inference – 20x faster than OpenAI and Anthropic​
  • Community support via Discord
Get Api Key
general api access

Developer

Generous rate limits for power users​

Everything in Free, plus:​

  • Self-serve payment starting at just $10​
  • 10x higher rate limits than free tier​
  • Higher priority processing
Get API Key
general api access

Enterprise

Highest throughput, custom weights, and guaranteed uptime.​

Everything in Developer, plus:​

  • Highest rate limits for production workloads​
  • Lowest latency with dedicated queue priority​
  • Support for custom model weights​
  • Model fine-tuning and training services​
  • Dedicated support team with response time guarantees​
Contact Sales

Developer tier Pricing

*Preview models are intended for evaluation purposes only, and are not intended for use in production environments. They may be discontinued at short notice.

**Models have been scheduled for deprecation as part of ongoing efforts to serve the most up-to-date models.

Partners

Get access to Cerebras Inference through our partner APIs​