Xiaomi: MiMo-V2-Omni

Fiabilité 20%

xiaomi/mimo-v2-omni

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited for complex...

🧠 Intelligence & Données
Knowledge Cutoff: Inconnue
Tokenizer: Other
Moderation: ✅ Non
📅 Cycle de vie
Ajouté le: 18/03/2026
Spécifications
  • Provider & Modalité xiaomi text+image+audio+video->text
  • Fenêtre de contexte 262,144 tokens
  • Max Output Tokens 65,536
  • Support des Outils (Tools) ✔️ Fonction Calling
🔍 Modèles similaires
Modèle Provider Input Output Contexte
Google: Gemma 4 26B A4B
google/gemma-4-26b-a4b-it
google $0.1300 $0.4000 262,144
Google: Gemma 4 31B
google/gemma-4-31b-it
google $0.1400 $0.4000 262,144
Z.ai: GLM 5V Turbo
z-ai/glm-5v-turbo
z-ai $1.2000 $4.0000 202,752
Arcee AI: Trinity Large Thinking
arcee-ai/trinity-large-th...
arcee-ai $0.2200 $0.8500 262,144
Kwaipilot: KAT-Coder-Pro V2
kwaipilot/kat-coder-pro-v...
kwaipilot $0.3000 $1.2000 256,000