DeepSeek: R1 Distill Llama 70B
Fiabilité
20%
deepseek/deepseek-r1-distill-llama-70b
DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across multiple benchmarks...
🧠 Intelligence & Données
Knowledge Cutoff:
2024-07-31
Tokenizer:
Llama3
Moderation:
✅ Non
📅 Cycle de vie
Ajouté le:
23/01/2025
Spécifications
-
Provider & Modalité deepseek text->text
-
Fenêtre de contexte 131,072 tokens
-
Max Output Tokens 16,384
-
Support des Outils (Tools) Non supporté
🔍 Modèles similaires
| Modèle | Provider | Input | Output | Contexte | |
|---|---|---|---|---|---|
|
Inception: Mercury 2
inception/mercury-2
|
inception | $0.2500 | $0.7500 | 128,000 | → |
|
OpenAI: GPT-5.3 Chat
openai/gpt-5.3-chat
|
openai | $1.7500 | $14.0000 | 128,000 | → |
|
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-i...
|
$0.5000 | $3.0000 | 65,536 | → | |
|
AionLabs: Aion-2.0
aion-labs/aion-2.0
|
aion-labs | $0.8000 | $1.6000 | 131,072 | → |
|
MiniMax: MiniMax M2.5 (free)
minimax/minimax-m2.5:free
|
minimax | $0.0000 | $0.0000 | 196,608 | → |