Pricing & Benchmarks for
300+ AI Models

The professional kit for developers and enterprises to compare AI model costs, context windows, and performance benchmarks.

Start Comparing

Featured Models

View All →
openai
$1.75/1M

OpenAI: GPT-5.3 Chat

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations s...

πŸ“ 128,000 ctx Compare →
google
$0.25/1M

Google: Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume ...

πŸ“ 1,048,576 ctx Compare →
bytedance-seed
$0.10/1M

ByteDance Seed: Seed-2.0-Mini

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, e...

πŸ“ 262,144 ctx Compare →
google
$0.50/1M

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the ...

πŸ“ 65,536 ctx Compare →
qwen
$0.16/1M

Qwen: Qwen3.5-35B-A3B

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid archit...

πŸ“ 262,144 ctx Compare →
qwen
$0.20/1M

Qwen: Qwen3.5-27B

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechani...

πŸ“ 262,144 ctx Compare →
qwen
$0.26/1M

Qwen: Qwen3.5-122B-A10B

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that ...

πŸ“ 262,144 ctx Compare →
qwen
$0.10/1M

Qwen: Qwen3.5-Flash

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that in...

πŸ“ 1,000,000 ctx Compare →
liquid
$0.03/1M

LiquidAI: LFM2-24B-A2B

LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for ...

πŸ“ 32,768 ctx Compare →
google
$2.00/1M

Google: Gemini 3.1 Pro Preview Custom Tools

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool sele...

πŸ“ 1,048,576 ctx Compare →
openai
$1.75/1M

OpenAI: GPT-5.3-Codex

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier sof...

πŸ“ 400,000 ctx Compare →
aion-labs
$0.80/1M

AionLabs: Aion-2.0

Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytellin...

πŸ“ 131,072 ctx Compare →

Browse by Provider

πŸ“¦ Ai21 1 Models πŸ“¦ Aion-labs 4 Models πŸ“¦ Alfredpros 1 Models πŸ“¦ Alibaba 1 Models πŸ“¦ Allenai 7 Models πŸ“¦ Alpindale 1 Models πŸ“¦ Amazon 5 Models πŸ“¦ Anthracite-org 1 Models πŸ›‘οΈ Anthropic 13 Models πŸ“¦ Arcee-ai 7 Models πŸ“¦ Baidu 5 Models πŸ“¦ Bytedance 1 Models πŸ“¦ Bytedance-seed 3 Models πŸ“¦ Cognitivecomputations 1 Models 🏒 Cohere 4 Models πŸ“¦ Deepcogito 1 Models πŸ“¦ Deepseek 12 Models πŸ“¦ Eleutherai 1 Models πŸ“¦ Essentialai 1 Models πŸ” Google 27 Models πŸ“¦ Gryphe 1 Models πŸ“¦ Ibm-granite 1 Models πŸ“¦ Inception 2 Models πŸ“¦ Inflection 2 Models πŸ“¦ Kwaipilot 1 Models πŸ“¦ Liquid 5 Models πŸ“¦ Mancer 1 Models πŸ“¦ Meituan 1 Models πŸ“¦ Meta-llama 17 Models πŸ“¦ Microsoft 2 Models πŸ“¦ Minimax 6 Models πŸ“¦ Mistralai 25 Models πŸ“¦ Moonshotai 5 Models πŸ“¦ Morph 2 Models πŸ“¦ Neversleep 2 Models πŸ“¦ Nex-agi 1 Models πŸ“¦ Nousresearch 6 Models πŸ“¦ Nvidia 8 Models ⚑ Openai 59 Models πŸ“¦ Openrouter 3 Models πŸ“¦ Perplexity 5 Models πŸ“¦ Prime-intellect 1 Models πŸ“¦ Qwen 50 Models πŸ“¦ Raifle 1 Models πŸ“¦ Relace 2 Models πŸ“¦ Sao10k 5 Models πŸ“¦ Stepfun 2 Models πŸ“¦ Switchpoint 1 Models πŸ“¦ Tencent 1 Models πŸ“¦ Thedrummer 4 Models πŸ“¦ Tngtech 1 Models πŸ“¦ Undi95 1 Models πŸ“¦ Upstage 1 Models πŸ“¦ Writer 1 Models πŸ“¦ X-ai 8 Models πŸ“¦ Xiaomi 1 Models πŸ“¦ Z-ai 11 Models

Frequently Asked Questions

On this page:

What is "Writer: Palmyra X5" optimized for?

Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agents across the enterprise. It delivers industry-leading speed and efficiency on context windows up to 1 million tokens.

What are the input and output modalities of "OpenAI: GPT Audio"?

"OpenAI: GPT Audio" can process both text and audio as input, and can generate both text and audio as output.

What is the context length of "MiniMax: MiniMax M2-her"?

"MiniMax: MiniMax M2-her" has a context length of 32,768 tokens, making it suitable for multi-turn conversations.

Is there a free model for on-device AI?

Yes, "LiquidAI: LFM2.5-1.2B-Instruct (free)" is a compact, high-performance instruction-tuned model built for fast on-device AI and is free to use.

Browse by Use Case

πŸ’» Coding 95 Models 🧠 Reasoning 185 Models πŸ‘οΈ Vision 76 Models πŸ“š Long Context 250 Models

Browse by Use Case

πŸ’» Coding 95 Models 🧠 Reasoning 185 Models πŸ‘οΈ Vision 76 Models πŸ“š Long Context 250 Models