ByteDance: UI-TARS 7B

Fiabilité 20%

bytedance/ui-tars-1.5-7b

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement learning-based reasoning, enabling robust action planning and exe...

🧠 Intelligence & Données
Knowledge Cutoff: 2025-01-31
Tokenizer: Other
Moderation: ✅ Non
📅 Cycle de vie
Ajouté le: 22/07/2025
Spécifications
  • Provider & Modalité bytedance text+image->text
  • Fenêtre de contexte 128,000 tokens
  • Max Output Tokens 2,048
  • Support des Outils (Tools) Non supporté
🔍 Modèles similaires
Modèle Provider Input Output Contexte
Inception: Mercury 2
inception/mercury-2
inception $0.2500 $0.7500 128,000
OpenAI: GPT-5.3 Chat
openai/gpt-5.3-chat
openai $1.7500 $14.0000 128,000
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-i...
google $0.5000 $3.0000 65,536
AionLabs: Aion-2.0
aion-labs/aion-2.0
aion-labs $0.8000 $1.6000 131,072
Z.ai: GLM 5
z-ai/glm-5
z-ai $0.7200 $2.3000 80,000