inception

Inception: Mercury

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the [blog post] (https://www.inceptionlabs.ai/blog/introducing-mercury) here.

Input Cost
$0.25
per 1M tokens
Output Cost
$0.75
per 1M tokens
Context Window
128,000
tokens
Compare vs GPT-4o
Developer ID: inception/mercury

Related Models

inception
$0.25/1M

Inception: Mercury Coder

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough ...

📝 128,000 ctx Compare →
openai
$1.75/1M

OpenAI: GPT-5.3 Chat

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations s...

📝 128,000 ctx Compare →
google
$0.25/1M

Google: Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume ...

📝 1,048,576 ctx Compare →