CO/AI Subscribe
Monday · June 22, 2026 · Issue No. 904
Video

OpenAI GPT-4o Speech Models in 6 Minutes

Watch on YouTube

OpenAI Enhances Speech Models: New Text-to-Speech & Speech-to-Text Innovations In today’s video, we delve into OpenAI’s latest release of three new audio models. Discover the enhanced speech-to-text models superior to Whisper, and a groundbreaking text-to-speech model allowing precise control over timing and emotion. Learn how to try these models for free on OpenAI’s interface, designed with a distinctive, practical look by Teenage Engineering. Explore various voice types, personality settings, and pronunciation controls. We also compare new models, GPT-4 Transcribe and GPT-4 Mini Transcribe, against other state-of-the-art models. The video provides cost details and a simple guide to getting started with these models using Python, JavaScript, or cURL scripts in the OpenAI API. Additionally, insights into logging, tracing, and example setups in OpenAI Agents SDK are shared. Don’t miss out on the future of AI voice applications! Links: https://www.openai.fm/ https://www.youtube.com/watch?v=lXb0L16ISAc https://platform.openai.com/playground/tts https://platform.openai.com/docs/guides/audio https://platform.openai.com/docs/guides/speech-to-text https://platform.openai.com/docs/guides/text-to-speech https://platform.openai.com/docs/api-reference/introduction https://github.com/openai/openai-agents-python/tree/main/examples 00:00 Introduction to OpenAI’s New Audio Models 00:16 Exploring the Interface and Features 01:01 Demonstration of Text-to-Speech Capabilities 02:21 New Speech-to-Text Models and Their Performance 03:18 Getting Started with OpenAI’s API 04:21 Using OpenAI Agents SDK 05:15 Conclusion and Final Thoughts

Share: X LinkedIn Email
Video Feed

More videos

All videos →
Claude Fable 5: When Capability Meets Economics
Video

Claude Fable 5: When Capability Meets Economics

Anthropic released Cloud Fable 5 with a paradox built in: safeguards sophisticated enough to let a mythosclass model...

Run Agentic AI Entirely on Your Mac—No Cloud, No Latency, No Privacy Tradeoffs
Video

Run Agentic AI Entirely on Your Mac—No Cloud, No Latency, No Privacy Tradeoffs

Apple’s MLX framework is mature enough now that you can run serious agentic AI workflows locally on Silicon...

Hermes Agent Master Class
Video

Hermes Agent Master Class

Welcome to the Hermes Agent Master Class — an 11-episode series taking you from zero to fully leveraging...

CONSULTING

Outsider
Labs.

A management consulting team focused on AI transformations for executives and business owners.

Work with us →