Xiaomi: MiMo-V2-Omni
Fiabilité
20%
xiaomi/mimo-v2-omni
MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited for complex...
🧠 Intelligence & Données
Knowledge Cutoff:
Inconnue
Tokenizer:
Other
Moderation:
✅ Non
📅 Cycle de vie
Ajouté le:
18/03/2026
Spécifications
-
Provider & Modalité xiaomi text+image+audio+video->text
-
Fenêtre de contexte 262,144 tokens
-
Max Output Tokens 65,536
-
Support des Outils (Tools) ✔️ Fonction Calling
🔍 Modèles similaires
| Modèle | Provider | Input | Output | Contexte | |
|---|---|---|---|---|---|
|
Google: Gemma 4 26B A4B
google/gemma-4-26b-a4b-it
|
$0.1300 | $0.4000 | 262,144 | → | |
|
Google: Gemma 4 31B
google/gemma-4-31b-it
|
$0.1400 | $0.4000 | 262,144 | → | |
|
Z.ai: GLM 5V Turbo
z-ai/glm-5v-turbo
|
z-ai | $1.2000 | $4.0000 | 202,752 | → |
|
Arcee AI: Trinity Large Thinking
arcee-ai/trinity-large-th...
|
arcee-ai | $0.2200 | $0.8500 | 262,144 | → |
|
Kwaipilot: KAT-Coder-Pro V2
kwaipilot/kat-coder-pro-v...
|
kwaipilot | $0.3000 | $1.2000 | 256,000 | → |