×
How to try ChatGPT’s new Advanced Voice with Vision feature
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

The recent integration of voice commands and visual analysis capabilities into ChatGPT marks a significant advancement in making artificial intelligence more accessible and interactive for everyday users.

New Feature Overview: OpenAI has unveiled Advanced Voice with Vision as part of their ‘12 Days of OpenAI‘ demonstration, combining voice interaction and image analysis capabilities within ChatGPT.

  • The feature enables users to interact with ChatGPT through spoken commands while also analyzing uploaded images and video content
  • This enhancement is exclusively available to ChatGPT Plus and Pro subscribers, who pay $20 monthly for access
  • A special ‘Chat with Santa‘ feature has been made available to all users, including those on the free tier

Technical Implementation: The new functionality is designed to be intuitive and easily accessible through ChatGPT’s user interface.

  • Users can activate voice input through a microphone icon in the interface
  • Visual analysis is enabled via a camera/image upload icon
  • The Santa chat feature is denoted by a snowflake icon in the interface
  • The rollout is being implemented gradually across global markets to ensure system stability

Practical Applications: The combination of voice and visual capabilities opens up new use cases for everyday tasks and professional workflows.

  • Users can photograph their pantry contents and receive recipe suggestions based on available ingredients
  • The system can analyze and provide feedback on handwritten notes and documents
  • Professional users can leverage verbal and visual inputs for creating presentations, editing photos, and developing narratives

Feature Accessibility: The strategic rollout demonstrates OpenAI’s tiered approach to feature distribution.

  • Premium subscribers gain immediate access to the full suite of new capabilities
  • Free tier users maintain access to basic features and special releases like the seasonal Santa chat
  • The global rollout strategy suggests a careful approach to scaling these resource-intensive features

Looking Forward: While this integration represents a significant step in making AI more intuitive, the premium pricing structure may limit widespread adoption among casual users, potentially creating a divide between those who can access these advanced features and those who cannot.

ChatGPT Advanced Voice with Vision just launched — here’s how to try it

Recent News

Veo 2 vs. Sora: A closer look at Google and OpenAI’s latest AI video tools

Tech companies unveil AI tools capable of generating realistic short videos from text prompts, though length and quality limitations persist as major hurdles.

7 essential ways to use ChatGPT’s new mobile search feature

OpenAI's mobile search upgrade enables business users to access current market data and news through conversational queries, marking a departure from traditional search methods.

FastVideo is an open-source framework that accelerates video diffusion models

New optimization techniques reduce the computing power needed for AI video generation from days to hours, though widespread adoption remains limited by hardware costs.