×
ChatGPT’s Advanced Voice Feature Generating Buzz and Concern Among Early Users
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

ChatGPT’s Advanced Voice feature is generating buzz with its impressive capabilities, though some realistic outputs are causing unease.

Key Takeaways:

  • Select users have access to ChatGPT’s new Advanced Voice feature, which allows the AI to speak in various accents, dialects, and even imitate specific scenarios like an airline pilot’s announcement.
  • The AI’s adaptability to different voices and accents is remarkable, with examples showcasing its ability to speak dozens of languages and regional variations.
  • While the feature is not perfect yet, with limitations like struggling to add appropriate sound effects, the overall performance is still impressive and has the potential to reduce the need for human interaction in certain contexts.

Showcasing Versatility:

  • Users have posted examples of ChatGPT speaking in various U.S. accents, including New York, Boston, and Wisconsin, demonstrating the AI’s ability to mimic regional variations of English.
  • The AI can even attempt to talk like animals, with one user asking it to sing Happy Birthday as a frog, dog, and cat, resulting in amusing and somewhat ridiculous-sounding outputs.
  • ChatGPT’s voice can be adjusted in terms of pitch, although it has limitations in how high or low it can go.

Realistic Outputs:

  • One example features ChatGPT simulating a conversation between Elon Musk and Donald Trump, although the impersonations are described as “poor,” which may be a relief to some who might find overly realistic imitations concerning.
  • When asked to count quickly from one to 50, the AI sounded out of breath upon finishing, adding a layer of realism not typically seen in AI-generated speech.

Broader Implications:

While ChatGPT’s Advanced Voice feature is undeniably impressive, its ability to closely mimic human speech and behaviors raises questions about the potential impact on human interaction and communication. As the technology continues to evolve, it will be crucial to consider the ethical implications and establish guidelines to ensure responsible development and deployment of such AI systems. Additionally, the varying quality of impersonations and limitations in certain aspects, like adding sound effects, underscore the need for further refinement before the feature can be widely adopted for more serious applications.

ChatGPT’s new voice mode is freaking people out — see it try accents and tell stories as an airline pilot

Recent News

Nvidia’s new AI agents can search and summarize huge quantities of visual data

NVIDIA's new AI Blueprint combines computer vision and generative AI to enable efficient analysis of video and image content, with potential applications across industries and smart city initiatives.

How Boulder schools balance AI innovation with student data protection

Colorado school districts embrace AI in classrooms, focusing on ethical use and data privacy while preparing students for a tech-driven future.

Microsoft Copilot Vision nears launch — here’s what we know right now

Microsoft's new AI feature can analyze on-screen content, offering contextual assistance without the need for additional searches or explanations.