DeepSeek: R1 Distill Llama 70B

Fiabilité 20%

deepseek/deepseek-r1-distill-llama-70b

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across multiple benchmarks...

🧠 Intelligence & Données
Knowledge Cutoff: 2024-07-31
Tokenizer: Llama3
Moderation: ✅ Non
📅 Cycle de vie
Ajouté le: 23/01/2025
Spécifications
  • Provider & Modalité deepseek text->text
  • Fenêtre de contexte 131,072 tokens
  • Max Output Tokens 16,384
  • Support des Outils (Tools) Non supporté
🔍 Modèles similaires
Modèle Provider Input Output Contexte
Inception: Mercury 2
inception/mercury-2
inception $0.2500 $0.7500 128,000
OpenAI: GPT-5.3 Chat
openai/gpt-5.3-chat
openai $1.7500 $14.0000 128,000
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-i...
google $0.5000 $3.0000 65,536
AionLabs: Aion-2.0
aion-labs/aion-2.0
aion-labs $0.8000 $1.6000 131,072
MiniMax: MiniMax M2.5 (free)
minimax/minimax-m2.5:free
minimax $0.0000 $0.0000 196,608