NVIDIA: Llama 3.1 Nemotron Ultra 253B v1
Fiabilité
20%
nvidia/llama-3.1-nemotron-ultra-253b-v1
Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural Architecture Search...
🧠 Intelligence & Données
Knowledge Cutoff:
2024-03-31
Tokenizer:
Llama3
Moderation:
✅ Non
📅 Cycle de vie
Ajouté le:
08/04/2025
Spécifications
-
Provider & Modalité nvidia text->text
-
Fenêtre de contexte 131,072 tokens
-
Max Output Tokens 0
-
Support des Outils (Tools) Non supporté
🔍 Modèles similaires
| Modèle | Provider | Input | Output | Contexte | |
|---|---|---|---|---|---|
|
NVIDIA: Nemotron 3 Super (free)
nvidia/nemotron-3-super-1...
|
nvidia | $0.0000 | $0.0000 | 262,144 | → |
|
NVIDIA: Nemotron 3 Super
nvidia/nemotron-3-super-1...
|
nvidia | $0.1000 | $0.5000 | 262,144 | → |
|
Inception: Mercury 2
inception/mercury-2
|
inception | $0.2500 | $0.7500 | 128,000 | → |
|
OpenAI: GPT-5.3 Chat
openai/gpt-5.3-chat
|
openai | $1.7500 | $14.0000 | 128,000 | → |
|
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-i...
|
$0.5000 | $3.0000 | 65,536 | → |