NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

Fiabilité 20%

nvidia/llama-3.1-nemotron-ultra-253b-v1

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural Architecture Search...

🧠 Intelligence & Données
Knowledge Cutoff: 2024-03-31
Tokenizer: Llama3
Moderation: ✅ Non
📅 Cycle de vie
Ajouté le: 08/04/2025
Spécifications
  • Provider & Modalité nvidia text->text
  • Fenêtre de contexte 131,072 tokens
  • Max Output Tokens 0
  • Support des Outils (Tools) Non supporté
🔍 Modèles similaires
Modèle Provider Input Output Contexte
NVIDIA: Nemotron 3 Super (free)
nvidia/nemotron-3-super-1...
nvidia $0.0000 $0.0000 262,144
NVIDIA: Nemotron 3 Super
nvidia/nemotron-3-super-1...
nvidia $0.1000 $0.5000 262,144
Inception: Mercury 2
inception/mercury-2
inception $0.2500 $0.7500 128,000
OpenAI: GPT-5.3 Chat
openai/gpt-5.3-chat
openai $1.7500 $14.0000 128,000
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-i...
google $0.5000 $3.0000 65,536