Pricing & Benchmarks for
300+ AI Models
The professional kit for developers and enterprises to compare AI model costs, context windows, and performance benchmarks.
Start ComparingFeatured Models
View All →Anthropic: Claude Fable Latest
This model always redirects to the latest model in the Claude Fable family.
Anthropic: Claude Fable 5
Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work...
Nex AGI: Nex-N2-Pro (free)
Nex-N2-Pro is an agentic mixture-of-experts model from Nex AGI, with 17B active parameters...
NVIDIA: Nemotron 3.5 Content Safety (free)
NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model fr...
NVIDIA: Nemotron 3 Ultra (free)
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...
NVIDIA: Nemotron 3 Ultra
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...
Qwen: Qwen3.7 Plus
Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and i...
MiniMax: MiniMax M3
MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and vid...
StepFun: Step 3.7 Flash
Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It...
Anthropic: Claude Opus 4.8 (Fast)
Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with ...
Anthropic: Claude Opus 4.8
Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. ...
Qwen: Qwen3.7 Max
Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and ...
Browse by Provider
Frequently Asked Questions
On this page:
What is "Writer: Palmyra X5" optimized for?
Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agents across the enterprise. It delivers industry-leading speed and efficiency on context windows up to 1 million tokens.
What are the input and output modalities of "OpenAI: GPT Audio"?
"OpenAI: GPT Audio" can process both text and audio as input, and can generate both text and audio as output.
What is the context length of "MiniMax: MiniMax M2-her"?
"MiniMax: MiniMax M2-her" has a context length of 32,768 tokens, making it suitable for multi-turn conversations.
Is there a free model for on-device AI?
Yes, "LiquidAI: LFM2.5-1.2B-Instruct (free)" is a compact, high-performance instruction-tuned model built for fast on-device AI and is free to use.