Skip to main content
Grab any model ID from the tables below and pass it as the model parameter. Format: provider/model-name.
response = client.chat.completions.create(
    model="openai/gpt-4o",  # any model ID from the tables below
    messages=[{"role": "user", "content": "Hello!"}]
)

OpenAI

Model IDNameInputOutputContext
openai/gpt-5.4GPT-5.4$2.50$15.001,050,000
openai/gpt-5.4-proGPT-5.4 Pro$30.00$180.001,050,000
openai/gpt-5.2GPT-5.2$1.75$14.00400,000
openai/gpt-5.2-proGPT-5.2 Pro$21.00$168.00400,000
openai/gpt-5.2-codexGPT-5.2 Codex$1.75$14.00400,000
openai/gpt-5.1GPT-5.1$1.25$10.00400,000
openai/gpt-5.1-codexGPT-5.1 Codex$1.25$10.00400,000
openai/gpt-5.1-codex-maxGPT-5.1 Codex Max$1.25$10.00400,000
openai/gpt-5.1-codex-miniGPT-5.1 Codex Mini$0.25$2.00400,000
openai/gpt-5GPT-5$1.25$10.00400,000
openai/gpt-5-proGPT-5 Pro$15.00$120.00400,000
openai/gpt-5-codexGPT-5 Codex$1.25$10.00400,000
openai/gpt-5-miniGPT-5 Mini$0.25$2.00400,000
openai/gpt-5-nanoGPT-5 Nano$0.05$0.40400,000
openai/gpt-5-imageGPT-5 Image$10.00$10.00400,000
openai/gpt-5-image-miniGPT-5 Image Mini$2.50$2.00400,000
openai/gpt-4.1GPT-4.1$2.00$8.001,047,576
openai/gpt-4.1-miniGPT-4.1 Mini$0.40$1.601,047,576
openai/gpt-4.1-nanoGPT-4.1 Nano$0.10$0.401,047,576
openai/gpt-4oGPT-4o$2.50$10.00128,000
openai/gpt-4o-miniGPT-4o Mini$0.15$0.60128,000
openai/gpt-4o-search-previewGPT-4o Search Preview$2.50$10.00128,000
openai/gpt-4o-mini-search-previewGPT-4o Mini Search Preview$0.15$0.60128,000
openai/gpt-4o-audio-previewGPT-4o Audio$2.50$10.00128,000
openai/gpt-4-turboGPT-4 Turbo$10.00$30.00128,000
openai/gpt-4GPT-4$30.00$60.008,191
openai/gpt-3.5-turboGPT-3.5 Turbo$0.50$1.5016,385
openai/o4-minio4 Mini$1.10$4.40200,000
openai/o4-mini-higho4 Mini High$1.10$4.40200,000
openai/o3o3$2.00$8.00200,000
openai/o3-proo3 Pro$20.00$80.00200,000
openai/o3-minio3 Mini$1.10$4.40200,000
openai/o3-deep-researcho3 Deep Research$10.00$40.00200,000
openai/o1o1$15.00$60.00200,000
openai/o1-proo1 Pro$150.00$600.00200,000

Anthropic

Model IDNameInputOutputContext
anthropic/claude-opus-4.6Claude Opus 4.6$5.00$25.001,000,000
anthropic/claude-opus-4.5Claude Opus 4.5$5.00$25.00200,000
anthropic/claude-opus-4.1Claude Opus 4.1$15.00$75.00200,000
anthropic/claude-opus-4Claude Opus 4$15.00$75.00200,000
anthropic/claude-sonnet-4.6Claude Sonnet 4.6$3.00$15.001,000,000
anthropic/claude-sonnet-4.5Claude Sonnet 4.5$3.00$15.001,000,000
anthropic/claude-sonnet-4Claude Sonnet 4$3.00$15.00200,000
anthropic/claude-3.7-sonnetClaude 3.7 Sonnet$3.00$15.00200,000
anthropic/claude-3.7-sonnet:thinkingClaude 3.7 Sonnet (thinking)$3.00$15.00200,000
anthropic/claude-3.5-sonnetClaude 3.5 Sonnet$6.00$30.00200,000
anthropic/claude-haiku-4.5Claude Haiku 4.5$1.00$5.00200,000
anthropic/claude-3.5-haikuClaude 3.5 Haiku$0.80$4.00200,000
anthropic/claude-3-haikuClaude 3 Haiku$0.25$1.25200,000

Google

Model IDNameInputOutputContext
google/gemini-3.1-pro-previewGemini 3.1 Pro Preview$2.00$12.001,048,576
google/gemini-3-pro-previewGemini 3 Pro Preview$2.00$12.001,048,576
google/gemini-3-flash-previewGemini 3 Flash Preview$0.50$3.001,048,576
google/gemini-2.5-proGemini 2.5 Pro$1.25$10.001,048,576
google/gemini-2.5-pro-previewGemini 2.5 Pro Preview$1.25$10.001,048,576
google/gemini-2.5-flashGemini 2.5 Flash$0.30$2.501,048,576
google/gemini-2.5-flash-liteGemini 2.5 Flash Lite$0.10$0.401,048,576
google/gemini-2.0-flash-001Gemini 2.0 Flash$0.10$0.401,048,576
google/gemini-2.0-flash-lite-001Gemini 2.0 Flash Lite$0.07$0.301,048,576
google/gemma-3-27b-itGemma 3 27B$0.04$0.15128,000
google/gemma-3-12b-itGemma 3 12B$0.04$0.13131,072
google/gemma-3-4b-itGemma 3 4B$0.04$0.08131,072
google/gemma-3n-e4b-itGemma 3n 4B$0.02$0.0432,768

Meta (Llama)

Model IDNameInputOutputContext
meta-llama/llama-4-maverickLlama 4 Maverick$0.15$0.601,048,576
meta-llama/llama-4-scoutLlama 4 Scout$0.08$0.30327,680
meta-llama/llama-3.3-70b-instructLlama 3.3 70B Instruct$0.10$0.32131,072
meta-llama/llama-3.1-405b-instructLlama 3.1 405B Instruct$4.00$4.00131,000
meta-llama/llama-3.1-70b-instructLlama 3.1 70B Instruct$0.40$0.40131,072
meta-llama/llama-3.1-8b-instructLlama 3.1 8B Instruct$0.02$0.0516,384
meta-llama/llama-3.2-11b-vision-instructLlama 3.2 11B Vision$0.05$0.05131,072
meta-llama/llama-3.2-3b-instructLlama 3.2 3B Instruct$0.05$0.3480,000
meta-llama/llama-3.2-1b-instructLlama 3.2 1B Instruct$0.03$0.2060,000

DeepSeek

Model IDNameInputOutputContext
deepseek/deepseek-v3.2DeepSeek V3.2$0.25$0.40163,840
deepseek/deepseek-chat-v3.1DeepSeek V3.1$0.15$0.7532,768
deepseek/deepseek-chatDeepSeek V3$0.32$0.89163,840
deepseek/deepseek-r1DeepSeek R1$0.70$2.5064,000
deepseek/deepseek-r1-0528DeepSeek R1 0528$0.45$2.15163,840
deepseek/deepseek-r1-distill-llama-70bR1 Distill Llama 70B$0.70$0.80131,072
deepseek/deepseek-r1-distill-qwen-32bR1 Distill Qwen 32B$0.29$0.2932,768

xAI (Grok)

Model IDNameInputOutputContext
x-ai/grok-4Grok 4$3.00$15.00256,000
x-ai/grok-4-fastGrok 4 Fast$0.20$0.502,000,000
x-ai/grok-4.1-fastGrok 4.1 Fast$0.20$0.502,000,000
x-ai/grok-3Grok 3$3.00$15.00131,072
x-ai/grok-3-miniGrok 3 Mini$0.30$0.50131,072
x-ai/grok-code-fast-1Grok Code Fast 1$0.20$1.50256,000

Mistral

Model IDNameInputOutputContext
mistralai/mistral-large-2512Mistral Large 3$0.50$1.50262,144
mistralai/mistral-largeMistral Large$2.00$6.00128,000
mistralai/mistral-medium-3.1Mistral Medium 3.1$0.40$2.00131,072
mistralai/mistral-medium-3Mistral Medium 3$0.40$2.00131,072
mistralai/mistral-small-3.2-24b-instructMistral Small 3.2 24B$0.06$0.18131,072
mistralai/mistral-small-3.1-24b-instructMistral Small 3.1 24B$0.35$0.56128,000
mistralai/mistral-nemoMistral Nemo$0.02$0.04131,072
mistralai/codestral-2508Codestral 2508$0.30$0.90256,000
mistralai/devstral-2512Devstral 2$0.40$2.00262,144
mistralai/devstral-mediumDevstral Medium$0.40$2.00131,072
mistralai/devstral-smallDevstral Small 1.1$0.10$0.30131,072
mistralai/pixtral-large-2411Pixtral Large$2.00$6.00131,072
mistralai/mistral-sabaSaba$0.20$0.6032,768

Qwen

Model IDNameInputOutputContext
qwen/qwen3-coderQwen3 Coder 480B A35B$0.22$1.00262,144
qwen/qwen3-coder-flashQwen3 Coder Flash$0.20$0.971,000,000
qwen/qwen3-coder-plusQwen3 Coder Plus$0.65$3.251,000,000
qwen/qwen3-coder-nextQwen3 Coder Next$0.12$0.75262,144
qwen/qwen3-maxQwen3 Max$1.20$6.00262,144
qwen/qwen3-max-thinkingQwen3 Max Thinking$0.78$3.90262,144
qwen/qwen3-235b-a22bQwen3 235B A22B$0.45$1.82131,072
qwen/qwen3-32bQwen3 32B$0.08$0.2440,960
qwen/qwen3-14bQwen3 14B$0.06$0.2440,960
qwen/qwen3-8bQwen3 8B$0.05$0.4040,960
qwen/qwen3.5-397b-a17bQwen3.5 397B A17B$0.39$2.34262,144
qwen/qwen3.5-122b-a10bQwen3.5 122B A10B$0.26$2.08262,144
qwen/qwen3.5-27bQwen3.5 27B$0.20$1.56262,144
qwen/qwen3.5-flash-02-23Qwen3.5 Flash$0.10$0.401,000,000
qwen/qwen3.5-plus-02-15Qwen3.5 Plus$0.26$1.561,000,000
qwen/qwen-maxQwen Max$1.04$4.1632,768
qwen/qwen-plusQwen Plus$0.40$1.201,000,000
qwen/qwen-turboQwen Turbo$0.03$0.13131,072
qwen/qwq-32bQwQ 32B$0.15$0.4032,768
qwen/qwen-2.5-72b-instructQwen2.5 72B Instruct$0.12$0.3932,768
qwen/qwen-2.5-coder-32b-instructQwen2.5 Coder 32B$0.20$0.2032,768

Amazon

Model IDNameInputOutputContext
amazon/nova-2-lite-v1Nova 2 Lite$0.30$2.501,000,000
amazon/nova-premier-v1Nova Premier$2.50$12.501,000,000
amazon/nova-pro-v1Nova Pro$0.80$3.20300,000
amazon/nova-lite-v1Nova Lite$0.06$0.24300,000
amazon/nova-micro-v1Nova Micro$0.04$0.14128,000

Cohere

Model IDNameInputOutputContext
cohere/command-aCommand A$2.50$10.00256,000
cohere/command-r-plus-08-2024Command R+$2.50$10.00128,000
cohere/command-r-08-2024Command R$0.15$0.60128,000
cohere/command-r7b-12-2024Command R7B$0.04$0.15128,000

MoonshotAI (Kimi)

Model IDNameInputOutputContext
moonshotai/kimi-k2.5Kimi K2.5$0.45$2.20262,144
moonshotai/kimi-k2-0905Kimi K2 0905$0.40$2.00131,072
moonshotai/kimi-k2Kimi K2$0.55$2.20131,000
moonshotai/kimi-k2-thinkingKimi K2 Thinking$0.47$2.00131,072

MiniMax

Model IDNameInputOutputContext
minimax/minimax-m2.5MiniMax M2.5$0.29$1.20196,608
minimax/minimax-m2.1MiniMax M2.1$0.27$0.95196,608
minimax/minimax-m2MiniMax M2$0.26$1.00196,608
minimax/minimax-m1MiniMax M1$0.40$2.201,000,000
minimax/minimax-01MiniMax 01$0.20$1.101,000,192

NVIDIA

Model IDNameInputOutputContext
nvidia/llama-3.3-nemotron-super-49b-v1.5Nemotron Super 49B V1.5$0.10$0.40131,072
nvidia/llama-3.1-nemotron-70b-instructNemotron 70B Instruct$1.20$1.20131,072
nvidia/nemotron-3-nano-30b-a3bNemotron 3 Nano 30B$0.05$0.20262,144
nvidia/nemotron-nano-12b-v2-vlNemotron Nano 12B VL$0.20$0.60131,072
nvidia/nemotron-nano-9b-v2Nemotron Nano 9B$0.04$0.16131,072

Perplexity

Model IDNameInputOutputContext
perplexity/sonar-proSonar Pro$3.00$15.00200,000
perplexity/sonar-pro-searchSonar Pro Search$3.00$15.00200,000
perplexity/sonar-reasoning-proSonar Reasoning Pro$2.00$8.00128,000
perplexity/sonar-deep-researchSonar Deep Research$2.00$8.00128,000
perplexity/sonarSonar$1.00$1.00127,072

Z.ai (GLM)

Model IDNameInputOutputContext
z-ai/glm-5GLM 5$0.80$2.56202,752
z-ai/glm-4.7GLM 4.7$0.30$1.40202,752
z-ai/glm-4.7-flashGLM 4.7 Flash$0.06$0.40202,752
z-ai/glm-4.6GLM 4.6$0.39$1.90204,800
z-ai/glm-4.5GLM 4.5$0.60$2.20131,072
z-ai/glm-4.5-airGLM 4.5 Air$0.13$0.85131,072

ByteDance (Seed)

Model IDNameInputOutputContext
bytedance-seed/seed-1.6Seed 1.6$0.25$2.00262,144
bytedance-seed/seed-1.6-flashSeed 1.6 Flash$0.07$0.30262,144
bytedance-seed/seed-2.0-miniSeed 2.0 Mini$0.10$0.40262,144

Baidu (ERNIE)

Model IDNameInputOutputContext
baidu/ernie-4.5-300b-a47bERNIE 4.5 300B$0.28$1.10123,000
baidu/ernie-4.5-21b-a3bERNIE 4.5 21B$0.07$0.28120,000
baidu/ernie-4.5-21b-a3b-thinkingERNIE 4.5 21B Thinking$0.07$0.28131,072
baidu/ernie-4.5-vl-424b-a47bERNIE 4.5 VL 424B$0.42$1.25123,000
baidu/ernie-4.5-vl-28b-a3bERNIE 4.5 VL 28B$0.14$0.5630,000

Inception (Mercury)

Model IDNameInputOutputContext
inception/mercury-2Mercury 2$0.25$0.75128,000
inception/mercuryMercury$0.25$0.75128,000
inception/mercury-coderMercury Coder$0.25$0.75128,000

Other Providers

Model IDNameInputOutputContext
ai21/jamba-large-1.7AI21 Jamba Large 1.7$2.00$8.00256,000
writer/palmyra-x5Writer Palmyra X5$0.60$6.001,040,000
upstage/solar-pro-3Upstage Solar Pro 3$0.15$0.60128,000
inflection/inflection-3-productivityInflection 3 Productivity$2.50$10.008,000
microsoft/phi-4Microsoft Phi 4$0.06$0.1416,384
tencent/hunyuan-a13b-instructTencent Hunyuan A13B$0.14$0.57131,072
xiaomi/mimo-v2-flashXiaomi MiMo V2 Flash$0.09$0.29262,144
stepfun/step-3.5-flashStepFun Step 3.5 Flash$0.10$0.30256,000
prime-intellect/intellect-3INTELLECT-3$0.20$1.10131,072
ibm-granite/granite-4.0-h-microIBM Granite 4.0 Micro$0.02$0.11131,000
Pricing is per million tokens. Prices may change, so check your dashboard for the latest rates.
Last modified on March 6, 2026