Choose the plan that's right for you
Developer
Powerful speed and reliability to start your project
Business
A plan that scales with your production usage
Enterprise
Personalized configurations for serving at scale
| Base model parameter count | $/1M tokens |
|---|---|
| up to 16B | $0.20 |
| 16.1B - 80B | $0.90 |
| Mixtral 8x7B | $0.50 |
Per-token pricing is applied only for non-enterprise deployments. Contact us for dedicated deployment pricing options.
| SDXL, $/step | SDXL w/ ControlNet, $/step |
|---|---|
| $0.0002 | $0.0003 |
For image generation models like SDXL we charge based on the number of inference steps (denoising iterations).
For multi-modal models like LLaVA, each image is billed as 576 prompt tokens.
| Base model parameter count | $/1M input tokens |
|---|---|
| up to 150M | $0.008 |
| 150M - 350M | $0.016 |
Embedding model pricing is based on the number of input tokens processed by the model.
| Model | $ / 1M tokens in training |
|---|---|
| Models up to 16B parameters | $0.50 |
| Models 16.1B - 80B | $3.00 |
| Mixtral 8x7B | $2.00 |
Fireworks charges based on the total number of tokens in your fine-tuning dataset (dataset size * number of epochs). A minimum charge of $3 is enforced (fine-tuning jobs that would have been charged less than $3 are rounded up to $3).