Long Context Models
Anthropic: Claude Fable Latest
This model always redirects to the latest model in the Claude Fable family.
Anthropic: Claude Fable 5
Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work...
Nex AGI: Nex-N2-Pro (free)
Nex-N2-Pro is an agentic mixture-of-experts model from Nex AGI, with 17B active parameters...
NVIDIA: Nemotron 3.5 Content Safety (free)
NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model fr...
NVIDIA: Nemotron 3 Ultra (free)
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...
NVIDIA: Nemotron 3 Ultra
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...
Qwen: Qwen3.7 Plus
Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and i...
MiniMax: MiniMax M3
MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and vid...
StepFun: Step 3.7 Flash
Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It...
Anthropic: Claude Opus 4.8 (Fast)
Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with ...
Anthropic: Claude Opus 4.8
Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. ...
Qwen: Qwen3.7 Max
Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and ...
xAI: Grok Build 0.1
Grok Build 0.1 is xAIβs fast coding model trained specifically for agentic software engi...
Google: Gemini 3.5 Flash
Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level cod...
Anthropic: Claude Opus 4.7 (Fast)
Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with ...
OpenRouter: Fusion
Fusion turns your prompt into a small multi-model deliberation. A panel of expert models (...
inclusionAI: Ring-2.6-1T
Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for r...
Google: Gemini 3.1 Flash Lite
Gemini 3.1 Flash Lite is Googleβs GA high-efficiency multimodal model optimized for low-...
OpenAI: GPT Chat Latest
GPT Chat Latest points to OpenAI's stable API alias `chat-latest` that always resolves to ...
xAI: Grok 4.3
Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output,...
IBM: Granite 4.1 8B
Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part ...
Mistral: Mistral Medium 3.5
Mistral Medium 3.5 is a dense 128B instruction-following model from Mistral AI. It support...
Owl Alpha
Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively ...
NVIDIA: Nemotron 3 Nano Omni (free)
NVIDIA Nemotronβ’ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as ...
Poolside: Laguna XS.2 (free)
Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://po...
Poolside: Laguna M.1 (free)
Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimi...
Anthropic Claude Haiku Latest
This model always redirects to the latest model in the Anthropic Claude Haiku family.
OpenAI GPT Mini Latest
This model always redirects to the latest model in the OpenAI GPT Mini family.
Google Gemini Pro Latest
This model always redirects to the latest model in the Google Gemini Pro family.
MoonshotAI Kimi Latest
This model always redirects to the latest model in the MoonshotAI Kimi family.
Google Gemini Flash Latest
This model always redirects to the latest model in the Google Gemini Flash family.
Anthropic Claude Sonnet Latest
This model always redirects to the latest model in the Anthropic Claude Sonnet family.
OpenAI GPT Latest
This model always redirects to the latest model in the OpenAI GPT family.
Qwen: Qwen3.5 Plus 2026-04-20
Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It acce...
Qwen: Qwen3.6 Flash
Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It suppo...
Qwen: Qwen3.6 35B A3B
Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion tota...
Qwen: Qwen3.6 Max Preview
Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse m...
Qwen: Qwen3.6 27B
Qwen3.6 27B is a dense 27-billion-parameter language model from the Qwen Team at Alibaba, ...
OpenAI: GPT-5.5 Pro
GPT-5.5 Pro is OpenAIβs high-capability model optimized for deep reasoning and accuracy ...
OpenAI: GPT-5.5
GPT-5.5 is OpenAIβs frontier model designed for complex professional workloads, building...
DeepSeek: DeepSeek V4 Pro
DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total pa...
DeepSeek: DeepSeek V4 Flash
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 2...
inclusionAI: Ling-2.6-1T
Ling-2.6-1T is an instant (instruct) model from inclusionAI and the companyβs trillion-p...
Tencent: Hy3 preview
Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agenti...
Xiaomi: MiMo-V2.5-Pro
MiMo-V2.5-Pro is Xiaomiβs flagship model, delivering strong performance in general agent...
Xiaomi: MiMo-V2.5
MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance...
OpenAI: GPT-5.4 Image 2
[GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model wi...
inclusionAI: Ling-2.6-flash
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters ...
Anthropic: Claude Opus Latest
This model always redirects to the latest model in the Claude Opus family.
Pareto Code Router
The Pareto Router maintains a tiered shortlist of strong coding models, ranked by [Artific...
MoonshotAI: Kimi K2.6 (free)
Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon cod...
MoonshotAI: Kimi K2.6
Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon cod...
Anthropic: Claude Opus 4.7
Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynch...
Anthropic: Claude Opus 4.6 (Fast)
Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with ...
Z.ai: GLM 5.1
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in...
Google: Gemma 4 26B A4B (free)
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google Deep...
Google: Gemma 4 26B A4B
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google Deep...
Google: Gemma 4 31B (free)
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and...
Google: Gemma 4 31B
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and...
Qwen: Qwen3.6 Plus
Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention wit...
Z.ai: GLM 5V Turbo
GLM-5V-Turbo is Z.aiβs first native multimodal agent foundation model, built for vision-...
Arcee AI: Trinity Large Thinking
Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI...
xAI: Grok 4.20 Multi-Agent
Grok 4.20 Multi-Agent is a variant of xAIβs Grok 4.20 designed for collaborative, agent-...
xAI: Grok 4.20
Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calli...
Google: Lyria 3 Pro Preview
Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music genera...
Google: Lyria 3 Clip Preview
30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music...
Kwaipilot: KAT-Coder-Pro V2
KAT-Coder-Pro V2 is the latest high-performance model in KwaiKATβs KAT-Coder series, des...
MiniMax: MiniMax M2.7
MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world...
OpenAI: GPT-5.4 Nano
GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, opt...
OpenAI: GPT-5.4 Mini
GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model opt...
Mistral: Mistral Small 4
Mistral Small 4 is the next major release in the Mistral Small family, unifying the capabi...
Z.ai: GLM 5 Turbo
GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in...
NVIDIA: Nemotron 3 Super (free)
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B par...
NVIDIA: Nemotron 3 Super
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B par...
ByteDance Seed: Seed-2.0-Lite
Seed-2.0-Lite is a versatile, costβefficient enterprise workhorse that delivers strong m...
Qwen: Qwen3.5-9B
Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver s...
OpenAI: GPT-5.4 Pro
GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture wi...
OpenAI: GPT-5.4
GPT-5.4 is OpenAIβs latest frontier model, unifying the Codex and GPT lines into a singl...
Inception: Mercury 2
Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM)...
OpenAI: GPT-5.3 Chat
GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations s...
Google: Gemini 3.1 Flash Lite Preview
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume ...
ByteDance Seed: Seed-2.0-Mini
Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, e...
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Googleβs latest state of the ...
Qwen: Qwen3.5-35B-A3B
The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid archit...
Qwen: Qwen3.5-27B
The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechani...
Qwen: Qwen3.5-122B-A10B
The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that ...
Qwen: Qwen3.5-Flash
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that in...
LiquidAI: LFM2-24B-A2B
LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for ...
Google: Gemini 3.1 Pro Preview Custom Tools
Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool sele...
OpenAI: GPT-5.3-Codex
GPT-5.3-Codex is OpenAIβs most advanced agentic coding model, combining the frontier sof...
AionLabs: Aion-2.0
Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytellin...
Google: Gemini 3.1 Pro Preview
Gemini 3.1 Pro Preview is Googleβs frontier reasoning model, delivering enhanced softwar...
Anthropic: Claude Sonnet 4.6
Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance a...
Qwen: Qwen3.5 Plus 2026-02-15
The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture t...
Qwen: Qwen3.5 397B A17B
The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architectur...
MiniMax: MiniMax M2.5
MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained ...
Z.ai: GLM 5
GLM-5 is Z.aiβs flagship open-source foundation model engineered for complex systems des...
Qwen: Qwen3 Max Thinking
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-...
Anthropic: Claude Opus 4.6
Opus 4.6 is Anthropicβs strongest model for coding and long-running professional tasks. ...
Qwen: Qwen3 Coder Next
Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and l...
Free Models Router
The simplest way to get free inference. openrouter/free is a router that selects free mode...
StepFun: Step 3.5 Flash
Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse M...
MoonshotAI: Kimi K2.5
Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual cod...
Upstage: Solar Pro 3
Solar Pro 3 is Upstage's powerful Mixture-of-Experts (MoE) language model. With 102B total...
Writer: Palmyra X5
Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agen...
OpenAI: GPT Audio
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot fe...
OpenAI: GPT Audio Mini
A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for m...
Z.ai: GLM 4.7 Flash
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and...
OpenAI: GPT-5.2-Codex
GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering a...
ByteDance Seed: Seed 1.6 Flash
Seed 1.6 Flash is an ultra-fast multimodal deep thinking model by ByteDance Seed, supporti...
ByteDance Seed: Seed 1.6
Seed 1.6 is a general-purpose model released by the ByteDance Seed team. It incorporates m...
MiniMax: MiniMax M2.1
MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding,...
Z.ai: GLM 4.7
GLM-4.7 is Z.aiβs latest flagship model, featuring upgrades in two key areas: enhanced p...
Google: Gemini 3 Flash Preview
Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic wor...
Xiaomi: MiMo-V2-Flash
MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mix...
NVIDIA: Nemotron 3 Nano 30B A3B (free)
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficien...
NVIDIA: Nemotron 3 Nano 30B A3B
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficien...
OpenAI: GPT-5.2 Chat
GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized fo...
OpenAI: GPT-5.2 Pro
GPT-5.2 Pro is OpenAIβs most advanced model, offering major improvements in agentic codi...
OpenAI: GPT-5.2
GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic ...
Mistral: Devstral 2 2512
Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic c...
Relace: Relace Search
The relace-search model uses 4-12 `view_file` and `grep` tools in parallel to explore a co...
Z.ai: GLM 4.6V
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and l...
Body Builder (beta)
Transform your natural language requests into structured OpenRouter API request objects. D...
OpenAI: GPT-5.1-Codex-Max
GPT-5.1-Codex-Max is OpenAIβs latest agentic coding model, designed for long-running, hi...
Amazon: Nova 2 Lite
Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can proc...
Mistral: Ministral 3 14B 2512
The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities ...
Mistral: Ministral 3 8B 2512
A balanced model in the Ministral 3 family, Ministral 3 8B is a powerful, efficient tiny l...
Mistral: Ministral 3 3B 2512
The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny...
Mistral: Mistral Large 3 2512
Mistral Large 3 2512 is Mistralβs most capable model to date, featuring a sparse mixture...
Arcee AI: Trinity Mini
Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featu...
DeepSeek: DeepSeek V3.2
DeepSeek-V3.2 is a large language model designed to harmonize high computational efficienc...
Prime Intellect: INTELLECT-3
INTELLECT-3 is a 106B-parameter Mixture-of-Experts model (12B active) post-trained from GL...
Anthropic: Claude Opus 4.5
Claude Opus 4.5 is Anthropicβs frontier reasoning model optimized for complex software e...
Deep Cogito: Cogito v2.1 671B
Cogito v2.1 671B MoE represents one of the strongest open models globally, matching perfor...
OpenAI: GPT-5.1
GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-...
OpenAI: GPT-5.1 Chat
GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for...
OpenAI: GPT-5.1-Codex
GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and c...
OpenAI: GPT-5.1-Codex-Mini
GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex
MoonshotAI: Kimi K2 Thinking
Kimi K2 Thinking is Moonshot AIβs most advanced open reasoning model to date, extending ...
Amazon: Nova Premier 1.0
Amazon Nova Premier is the most capable of Amazonβs multimodal models for complex reason...
Perplexity: Sonar Pro Search
Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity...
OpenAI: gpt-oss-safeguard-20b
gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This...
NVIDIA: Nemotron Nano 12B 2 VL (free)
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model design...
MiniMax: MiniMax M2
MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end cod...
Qwen: Qwen3 VL 32B Instruct
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-...
IBM: Granite 4.0 Micro
Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models ar...
Microsoft: Phi 4 Mini Instruct
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered pub...
OpenAI: GPT-5 Image Mini
GPT-5 Image Mini combines OpenAI's advanced language capabilities, powered by [GPT-5 Mini]...
Anthropic: Claude Haiku 4.5
Claude Haiku 4.5 is Anthropicβs fastest and most efficient model, delivering near-fronti...
Qwen: Qwen3 VL 8B Thinking
Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal mode...
Qwen: Qwen3 VL 8B Instruct
Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built...
OpenAI: GPT-5 Image
[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state...
OpenAI: o3 Deep Research
o3-deep-research is OpenAI's advanced model for deep research, designed to tackle complex,...
OpenAI: o4 Mini Deep Research
o4-mini-deep-research is OpenAI's faster, more affordable deep research modelβideal for ...
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model...
Qwen: Qwen3 VL 30B A3B Thinking
Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with v...
Qwen: Qwen3 VL 30B A3B Instruct
Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with v...
OpenAI: GPT-5 Pro
GPT-5 Pro is OpenAIβs most advanced model, offering major improvements in reasoning, cod...
Z.ai: GLM 4.6
Compared with GLM-4.5, this generation brings several key improvements: Longer context win...
Anthropic: Claude Sonnet 4.5
Claude Sonnet 4.5 is Anthropicβs most advanced Sonnet model to date, optimized for real-...
DeepSeek: DeepSeek V3.2 Exp
DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an inter...
TheDrummer: Cydonia 24B V4.1
Uncensored and creative writing model based on Mistral Small 3.2 24B with good recall, pro...
Relace: Relace Apply 3
Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight ...
Google: Gemini 2.5 Flash Lite Preview 09-2025
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized...
Qwen: Qwen3 VL 235B A22B Thinking
Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with...
Qwen: Qwen3 VL 235B A22B Instruct
Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text ge...
Qwen: Qwen3 Max
Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in ...
Qwen: Qwen3 Coder Plus
Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B...
OpenAI: GPT-5 Codex
GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and codin...
DeepSeek: DeepSeek V3.1 Terminus
DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that ...
Qwen: Qwen3 Coder Flash
Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 ...
Qwen: Qwen3 Next 80B A3B Thinking
Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that ou...
Qwen: Qwen3 Next 80B A3B Instruct (free)
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...
Qwen: Qwen3 Next 80B A3B Instruct
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...
Qwen: Qwen Plus 0728 (thinking)
Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoni...
Qwen: Qwen Plus 0728
Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoni...
NVIDIA: Nemotron Nano 9B V2 (free)
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...
NVIDIA: Nemotron Nano 9B V2
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...
MoonshotAI: Kimi K2 0905
Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-...
Qwen: Qwen3 30B A3B Thinking 2507
Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimize...
Nous: Hermes 4 70B
Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. ...
Nous: Hermes 4 405B
Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nou...
DeepSeek: DeepSeek V3.1
DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that support...
Mistral: Mistral Medium 3.1
Mistral Medium 3.1 is an updated version of Mistral Medium 3, which is a high-performance ...
AI21: Jamba Large 1.7
Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in gro...
OpenAI: GPT-5 Chat
GPT-5 Chat is designed for advanced, natural, multimodal, and context-aware conversations ...
OpenAI: GPT-5
GPT-5 is OpenAIβs most advanced model, offering major improvements in reasoning, code qu...
OpenAI: GPT-5 Mini
GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning task...
OpenAI: GPT-5 Nano
GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for develope...
OpenAI: gpt-oss-120b (free)
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...
OpenAI: gpt-oss-120b
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...
OpenAI: gpt-oss-20b (free)
gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 ...
OpenAI: gpt-oss-20b
gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 ...
Anthropic: Claude Opus 4.1
Claude Opus 4.1 is an updated version of Anthropicβs flagship model, offering improved p...
Mistral: Codestral 2508
Mistral's cutting-edge language model for coding released end of July 2025. Codestral spec...
Qwen: Qwen3 Coder 30B A3B Instruct
Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 ...
Qwen: Qwen3 30B A3B Instruct 2507
Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qw...
Z.ai: GLM 4.5
GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based application...
Z.ai: GLM 4.5 Air (free)
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-b...
Z.ai: GLM 4.5 Air
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-b...
Qwen: Qwen3 235B A22B Thinking 2507
Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) ...
Z.ai: GLM 4 32B
GLM 4 32B is a cost-effective foundation language model. It can efficiently perform comple...
Qwen: Qwen3 Coder 480B A35B (free)
Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...
Qwen: Qwen3 Coder 480B A35B
Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...
ByteDance: UI-TARS 7B
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, in...
Google: Gemini 2.5 Flash Lite
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized...
Qwen: Qwen3 235B A22B Instruct 2507
Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts lang...
Switchpoint Router
Switchpoint AI's router instantly analyzes your request and directs it to the optimal AI f...
MoonshotAI: Kimi K2 0711
Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moo...
Tencent: Hunyuan A13B Instruct
Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed b...
Morph: Morph V3 Large
Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accur...
Baidu: ERNIE 4.5 VL 424B A47B
ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baiduβs ERNIE...
Mistral: Mistral Small 3.2 24B
Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimiz...
MiniMax: MiniMax M1
MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and...
Google: Gemini 2.5 Flash
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for a...
Google: Gemini 2.5 Pro
Gemini 2.5 Pro is Googleβs state-of-the-art AI model designed for advanced reasoning, co...
OpenAI: o3 Pro
The o-series of models are trained with reinforcement learning to think before they answer...
Google: Gemini 2.5 Pro Preview 06-05
Gemini 2.5 Pro is Googleβs state-of-the-art AI model designed for advanced reasoning, co...
DeepSeek: R1 0528
May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par wi...
Anthropic: Claude Opus 4
Claude Opus 4 is benchmarked as the worldβs best coding model, at time of release, bring...
Anthropic: Claude Sonnet 4
Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, ex...
Mistral: Mistral Medium 3
Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver...
Google: Gemini 2.5 Pro Preview 05-06
Gemini 2.5 Pro is Googleβs state-of-the-art AI model designed for advanced reasoning, co...
Arcee AI: Virtuoso Large
VirtuosoβLarge is Arcee's topβtier generalβpurpose LLM at 72 B parameters, tuned to ...
Meta: Llama Guard 4 12B
Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for conte...
Qwen: Qwen3 30B A3B
Qwen3, the latest generation in the Qwen large language model series, features both dense ...
Qwen: Qwen3 8B
Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed f...
Qwen: Qwen3 14B
Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed...
Qwen: Qwen3 32B
Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimize...
Qwen: Qwen3 235B A22B
Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, acti...
OpenAI: o4 Mini High
OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort ...
OpenAI: o3
o3 is a well-rounded and powerful model across domains. It sets a new standard for math, s...
OpenAI: o4 Mini
OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-effi...
OpenAI: GPT-4.1
GPT-4.1 is a flagship large language model optimized for advanced instruction following, r...
OpenAI: GPT-4.1 Mini
GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substa...
OpenAI: GPT-4.1 Nano
For tasks that demand low latency, GPTβ4.1 nano is the fastest and cheapest model in the...
Meta: Llama 4 Maverick
Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Met...
Meta: Llama 4 Scout
Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by...
DeepSeek: DeepSeek V3 0324
DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the fl...
OpenAI: o1-pro
The o1 series of models are trained with reinforcement learning to think before they answe...
Mistral: Mistral Small 3.1 24B
Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring...
Google: Gemma 3 4B
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...
Google: Gemma 3 12B
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...
Cohere: Command A
Command A is an open-weights 111B parameter model with a 256k context window focused on de...
OpenAI: GPT-4o-mini Search Preview
GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It i...
OpenAI: GPT-4o Search Preview
GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trai...
Google: Gemma 3 27B
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...
Perplexity: Sonar Reasoning Pro
Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://doc...
Perplexity: Sonar Pro
Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://doc...
Perplexity: Sonar Deep Research
Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthes...
Llama Guard 3 8B
Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classifica...
OpenAI: o3 Mini High
OpenAI o3-mini-high is the same model as [o3-mini](/openai/o3-mini) with reasoning_effort ...
AionLabs: Aion-1.0
Aion-1.0 is a multi-model system designed for high performance across various tasks, inclu...
AionLabs: Aion-1.0-Mini
Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designe...
Qwen: Qwen2.5 VL 72B Instruct
Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and i...
Qwen: Qwen-Plus
Qwen-Plus, based on the Qwen2.5 foundation model, is a 131K context model with a balanced ...
OpenAI: o3 Mini
OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, part...
DeepSeek: R1 Distill Qwen 32B
DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](h...
DeepSeek: R1 Distill Llama 70B
DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-...
DeepSeek: R1
DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and...
MiniMax: MiniMax-01
MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image u...
DeepSeek: DeepSeek V3
DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction foll...
Sao10K: Llama 3.3 Euryale 70B
Euryale L3.3 70B is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/s...
OpenAI: o1
The latest and strongest model family from OpenAI, o1 is designed to spend more time think...
Cohere: Command R7B (12-2024)
Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in Decemb...
Meta: Llama 3.3 70B Instruct (free)
The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction...
Meta: Llama 3.3 70B Instruct
The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction...
Amazon: Nova Lite 1.0
Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast ...
Amazon: Nova Micro 1.0
Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in t...
Amazon: Nova Pro 1.0
Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combi...
OpenAI: GPT-4o (2024-11-20)
The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more na...
Mistral Large 2407
This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a ...
Qwen2.5 Coder 32B Instruct
Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly k...
Anthropic: Claude 3.5 Haiku
Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool...
Qwen: Qwen2.5 7B Instruct
Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the followin...
Meta: Llama 3.2 3B Instruct (free)
Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...
Meta: Llama 3.2 3B Instruct
Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...
Meta: Llama 3.2 1B Instruct
Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing nat...
Meta: Llama 3.2 11B Vision Instruct
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle ...
Qwen2.5 72B Instruct
Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the followi...
Cohere: Command R+ (08-2024)
command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) wit...
Cohere: Command R (08-2024)
command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved ...
Sao10K: Llama 3.1 Euryale 70B v2.2
Euryale L3.1 70B v2.2 is a model focused on creative roleplay from [Sao10k](https://ko-fi....
Nous: Hermes 3 70B Instruct
Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nou...
Nous: Hermes 3 405B Instruct (free)
Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...
Nous: Hermes 3 405B Instruct
Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...
OpenAI: GPT-4o (2024-08-06)
The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with t...
Meta: Llama 3.1 8B Instruct
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This ...
Meta: Llama 3.1 70B Instruct
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This ...
Mistral: Mistral Nemo
A 12B parameter model with a 128k token context length built by Mistral in collaboration w...
OpenAI: GPT-4o-mini (2024-07-18)
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting...
OpenAI: GPT-4o-mini
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting...
OpenAI: GPT-4o (2024-05-13)
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs...
OpenAI: GPT-4o
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs...
OpenAI: GPT-4 Turbo
The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mo...
Anthropic: Claude 3 Haiku
Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsivene...
Mistral Large
This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's ...
OpenAI: GPT-4 Turbo Preview
The preview GPT-4 model with improved instruction following, JSON mode, reproducible outpu...
Auto Router
Your prompt will be processed by a meta-model and routed to one of dozens of models (see b...