deepseek

DeepSeek: R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

Input Cost

$0.70

per 1M tokens

Output Cost

$0.80

per 1M tokens

Context Window

131,072

tokens

Compare vs GPT-4o

                Developer ID: deepseek/deepseek-r1-distill-llama-70b            

Related Models

deepseek

$0.50/1M

DeepSeek: R1 0528

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par wi...

📝 163,840 ctx Compare →

deepseek

$0.32/1M

DeepSeek: DeepSeek V3

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction foll...

📝 163,840 ctx Compare →

deepseek

$0.20/1M

DeepSeek: DeepSeek V3 0324

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the fl...

📝 163,840 ctx Compare →