Pricing & Benchmarks for
300+ AI Models

The professional kit for developers and enterprises to compare AI model costs, context windows, and performance benchmarks.

Start Comparing

Featured Models

View All →

openai

$1.75/1M

OpenAI: GPT-5.3 Chat

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations s...

📝 128,000 ctx Compare →

google

$0.25/1M

Google: Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume ...

📝 1,048,576 ctx Compare →

bytedance-seed

$0.10/1M

ByteDance Seed: Seed-2.0-Mini

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, e...

📝 262,144 ctx Compare →

google

$0.50/1M

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the ...

📝 65,536 ctx Compare →

qwen

$0.16/1M

Qwen: Qwen3.5-35B-A3B

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid archit...

📝 262,144 ctx Compare →

qwen

$0.20/1M

Qwen: Qwen3.5-27B

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechani...

📝 262,144 ctx Compare →

qwen

$0.26/1M

Qwen: Qwen3.5-122B-A10B

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that ...

📝 262,144 ctx Compare →

qwen

$0.10/1M

Qwen: Qwen3.5-Flash

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that in...

📝 1,000,000 ctx Compare →

liquid

$0.03/1M

LiquidAI: LFM2-24B-A2B

LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for ...

📝 32,768 ctx Compare →

google

$2.00/1M

Google: Gemini 3.1 Pro Preview Custom Tools

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool sele...

📝 1,048,576 ctx Compare →

openai

$1.75/1M

OpenAI: GPT-5.3-Codex

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier sof...

📝 400,000 ctx Compare →

aion-labs

$0.80/1M

AionLabs: Aion-2.0

Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytellin...

📝 131,072 ctx Compare →

Browse by Provider

📦 Ai21 1 Models 📦 Aion-labs 4 Models 📦 Alfredpros 1 Models 📦 Alibaba 1 Models 📦 Allenai 7 Models 📦 Alpindale 1 Models 📦 Amazon 5 Models 📦 Anthracite-org 1 Models 🛡️ Anthropic 13 Models 📦 Arcee-ai 7 Models 📦 Baidu 5 Models 📦 Bytedance 1 Models 📦 Bytedance-seed 3 Models 📦 Cognitivecomputations 1 Models 🏢 Cohere 4 Models 📦 Deepcogito 1 Models 📦 Deepseek 12 Models 📦 Eleutherai 1 Models 📦 Essentialai 1 Models 🔍 Google 27 Models 📦 Gryphe 1 Models 📦 Ibm-granite 1 Models 📦 Inception 2 Models 📦 Inflection 2 Models 📦 Kwaipilot 1 Models 📦 Liquid 5 Models 📦 Mancer 1 Models 📦 Meituan 1 Models 📦 Meta-llama 17 Models 📦 Microsoft 2 Models 📦 Minimax 6 Models 📦 Mistralai 25 Models 📦 Moonshotai 5 Models 📦 Morph 2 Models 📦 Neversleep 2 Models 📦 Nex-agi 1 Models 📦 Nousresearch 6 Models 📦 Nvidia 8 Models ⚡ Openai 59 Models 📦 Openrouter 3 Models 📦 Perplexity 5 Models 📦 Prime-intellect 1 Models 📦 Qwen 50 Models 📦 Raifle 1 Models 📦 Relace 2 Models 📦 Sao10k 5 Models 📦 Stepfun 2 Models 📦 Switchpoint 1 Models 📦 Tencent 1 Models 📦 Thedrummer 4 Models 📦 Tngtech 1 Models 📦 Undi95 1 Models 📦 Upstage 1 Models 📦 Writer 1 Models 📦 X-ai 8 Models 📦 Xiaomi 1 Models 📦 Z-ai 11 Models

Frequently Asked Questions

What is "Writer: Palmyra X5" optimized for?

Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agents across the enterprise. It delivers industry-leading speed and efficiency on context windows up to 1 million tokens.

What are the input and output modalities of "OpenAI: GPT Audio"?

"OpenAI: GPT Audio" can process both text and audio as input, and can generate both text and audio as output.

What is the context length of "MiniMax: MiniMax M2-her"?

"MiniMax: MiniMax M2-her" has a context length of 32,768 tokens, making it suitable for multi-turn conversations.

Is there a free model for on-device AI?

Yes, "LiquidAI: LFM2.5-1.2B-Instruct (free)" is a compact, high-performance instruction-tuned model built for fast on-device AI and is free to use.

Browse by Use Case

💻 Coding 95 Models 🧠 Reasoning 185 Models 👁️ Vision 76 Models 📚 Long Context 250 Models

Browse by Use Case

💻 Coding 95 Models 🧠 Reasoning 185 Models 👁️ Vision 76 Models 📚 Long Context 250 Models

Pricing & Benchmarks for300+ AI Models

Featured Models

OpenAI: GPT-5.3 Chat

Google: Gemini 3.1 Flash Lite Preview

ByteDance Seed: Seed-2.0-Mini

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)

Qwen: Qwen3.5-35B-A3B

Qwen: Qwen3.5-27B

Qwen: Qwen3.5-122B-A10B

Qwen: Qwen3.5-Flash

LiquidAI: LFM2-24B-A2B

Google: Gemini 3.1 Pro Preview Custom Tools

OpenAI: GPT-5.3-Codex

AionLabs: Aion-2.0

Browse by Provider

Frequently Asked Questions

On this page:

What is "Writer: Palmyra X5" optimized for?

What are the input and output modalities of "OpenAI: GPT Audio"?

What is the context length of "MiniMax: MiniMax M2-her"?

Is there a free model for on-device AI?

Browse by Use Case

Browse by Use Case

Pricing & Benchmarks for
300+ AI Models