Back to Registry

Fireworks-ai API Catalog

Comprehensive overview of all fireworks-ai models available through the LLM Kit

Overview

Provider
fireworks-ai
Total Models
20
Last Updated
2025-12-10

DeepSeek V3.1

Model ID accounts/fireworks/models/deepseek-v3p1
Family
deepseek-v3

Specifications

Context Window: 160,000 tokens
Max Output Tokens: 20,000 tokens

Modalities

Input
text
Output
text

Capabilities

Fine tuning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

DeepSeek V3.1 Terminus

Model ID accounts/fireworks/models/deepseek-v3p1-terminus
Family
deepseek-v3

Specifications

Context Window: 160,000 tokens
Max Output Tokens: 20,000 tokens

Modalities

Input
text
Output
text

Capabilities

Fine tuning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Deepseek v3.2

Model ID accounts/fireworks/models/deepseek-v3p2
Family
deepseek-v3

Specifications

Context Window: 160,000 tokens
Max Output Tokens: 20,000 tokens

Modalities

Input
text
Output
text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

FLUX.1 Kontext Max

Model ID accounts/fireworks/models/flux-kontext-max
Family
flux-kontext

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text, image

Capabilities

Image generation

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.08
Output
$0.08

FLUX.1 Kontext Pro

Model ID accounts/fireworks/models/flux-kontext-pro
Family
flux-kontext

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text, image

Capabilities

Image generation

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.04
Output
$0.04

FLUX.1 [dev] FP8

Model ID accounts/fireworks/models/flux-1-dev-fp8
Family
flux-1

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text, image

Capabilities

Image generation

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0005
Output
$0.0005

FLUX.1 [schnell] FP8

Model ID accounts/fireworks/models/flux-1-schnell-fp8
Family
flux-1

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text, image

Capabilities

Image generation

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.00035
Output
$0.00035

GLM-4.5V

Model ID accounts/fireworks/models/glm-4p5v
Family
glm-4p5v

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text, image
Output
text

Capabilities

Vision

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

GLM-4.6

Model ID accounts/fireworks/models/glm-4p6
Family
glm-4p5v

Specifications

Context Window: 198,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Fine tuning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Kimi K2 Instruct 0905

Model ID accounts/fireworks/models/kimi-k2-instruct-0905
Family
kimi-k2

Specifications

Context Window: 256,000 tokens
Max Output Tokens: 20,000 tokens

Modalities

Input
text
Output
text

Capabilities

Fine tuning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Kimi K2 Thinking

Model ID accounts/fireworks/models/kimi-k2-thinking
Family
kimi-k2

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 20,000 tokens

Modalities

Input
text
Output
text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Llama 4 Maverick Instruct (Basic)

Model ID accounts/fireworks/models/llama4-maverick-instruct-basic
Family
llama4-maverick

Specifications

Context Window: 1,000,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text, image
Output
text

Capabilities

Fine tuning Vision

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

NVIDIA Nemotron Nano 2 VL

Model ID accounts/fireworks/models/nemotron-nano-v2-12b-vl
Family
nemotron-nano

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text, image
Output
text

Capabilities

Vision

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

OpenAI gpt-oss-120b

Model ID accounts/fireworks/models/gpt-oss-120b
Family
gpt-oss

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Fine tuning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

OpenAI gpt-oss-20b

Model ID accounts/fireworks/models/gpt-oss-20b
Family
gpt-oss

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Fine tuning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Qwen2.5-VL 32B Instruct

Model ID accounts/fireworks/models/qwen2p5-vl-32b-instruct
Family
qwen2p5-vl

Specifications

Context Window: 125,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text, image
Output
text

Capabilities

Fine tuning Vision

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Streaming ASR v1

Model ID accounts/fireworks/models/fireworks-asr-large
Family
fireworks-asr-large

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 16,000 tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0032
Output
$0.0032

Streaming ASR v2

Model ID accounts/fireworks/models/fireworks-asr-v2
Family
fireworks-asr-v2

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 16,000 tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0035
Output
$0.0035

Whisper V3 Large

Model ID accounts/fireworks/models/whisper-v3
Family
whisper-v3

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 16,000 tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0015
Output
$0.0015

Whisper V3 Turbo

Model ID accounts/fireworks/models/whisper-v3-turbo
Family
whisper-v3-turbo

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 16,000 tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0009
Output
$0.0009