OpenAI Launches Three Real-Time Voice Models for Developers
OpenAI has introduced three new real-time voice models for developers via its Realtime API. These models are designed to handle live reasoning, speech translation across more than 70 languages, and real-time transcription. The launch aims to enhance voice-based applications, enabling more natural and responsive interactions. Developers can now integrate these capabilities into their own software, leveraging OpenAI's advanced AI to process and generate spoken language in real time. This move expands OpenAI's suite of developer tools, following the earlier release of text and image generation models.
Key facts
- OpenAI launched three new real-time voice models.
- The models are available via the Realtime API.
- They handle live reasoning, speech translation across 70+ languages, and real-time transcription.
- The launch targets developers building voice-based applications.
- This expands OpenAI's developer tool offerings beyond text and image models.
Entities
Institutions
- OpenAI
Sources
- Quartz —