OpenAI: GPT Audio

Fiabilité 20%

openai/gpt-audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

🧠 Intelligence & Données
Knowledge Cutoff: Inconnue
Tokenizer: GPT
Moderation: ⚠️ Oui
📅 Cycle de vie
Ajouté le: 19/01/2026
Spécifications
  • Provider & Modalité openai text+audio->text+audio
  • Fenêtre de contexte 128,000 tokens
  • Max Output Tokens 16,384
  • Support des Outils (Tools) ✔️ Fonction Calling
🔍 Modèles similaires
Modèle Provider Input Output Contexte
OpenAI: GPT-5.4 Nano
openai/gpt-5.4-nano
openai $0.2000 $1.2500 400,000
OpenAI: GPT-5.4 Mini
openai/gpt-5.4-mini
openai $0.7500 $4.5000 400,000
OpenAI: GPT-5.4 Pro
openai/gpt-5.4-pro
openai $30.0000 $180.0000 1,050,000
OpenAI: GPT-5.4
openai/gpt-5.4
openai $2.5000 $15.0000 1,050,000
Inception: Mercury 2
inception/mercury-2
inception $0.2500 $0.7500 128,000