tngtech

TNG: DeepSeek R1T2 Chimera

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AIโ€™s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The tri-parent design yields strong reasoning performance while running roughly 20 % faster than the original R1 and more than 2ร— faster than R1-0528 under vLLM, giving a favorable cost-to-intelligence trade-off. The checkpoint supports contexts up to 60 k tokens in standard use (tested to ~130 k) and maintains consistent <think> token behaviour, making it suitable for long-context analysis, dialogue and other open-ended generation tasks.

Input Cost
$0.25
per 1M tokens
Output Cost
$0.85
per 1M tokens
Context Window
163,840
tokens
Compare vs GPT-4o
Developer ID: tngtech/deepseek-r1t2-chimera

Related Models

bytedance-seed
$0.10/1M

ByteDance Seed: Seed-2.0-Mini

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, e...

๐Ÿ“ 262,144 ctx Compare →
google
$2.00/1M

Google: Gemini 3.1 Pro Preview Custom Tools

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool sele...

๐Ÿ“ 1,048,576 ctx Compare →
openai
$1.75/1M

OpenAI: GPT-5.3-Codex

GPT-5.3-Codex is OpenAIโ€™s most advanced agentic coding model, combining the frontier sof...

๐Ÿ“ 400,000 ctx Compare →