×
How to try ChatGPT’s new Advanced Voice with Vision feature
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

The recent integration of voice commands and visual analysis capabilities into ChatGPT marks a significant advancement in making artificial intelligence more accessible and interactive for everyday users.

New Feature Overview: OpenAI has unveiled Advanced Voice with Vision as part of their ‘12 Days of OpenAI‘ demonstration, combining voice interaction and image analysis capabilities within ChatGPT.

  • The feature enables users to interact with ChatGPT through spoken commands while also analyzing uploaded images and video content
  • This enhancement is exclusively available to ChatGPT Plus and Pro subscribers, who pay $20 monthly for access
  • A special ‘Chat with Santa‘ feature has been made available to all users, including those on the free tier

Technical Implementation: The new functionality is designed to be intuitive and easily accessible through ChatGPT’s user interface.

  • Users can activate voice input through a microphone icon in the interface
  • Visual analysis is enabled via a camera/image upload icon
  • The Santa chat feature is denoted by a snowflake icon in the interface
  • The rollout is being implemented gradually across global markets to ensure system stability

Practical Applications: The combination of voice and visual capabilities opens up new use cases for everyday tasks and professional workflows.

  • Users can photograph their pantry contents and receive recipe suggestions based on available ingredients
  • The system can analyze and provide feedback on handwritten notes and documents
  • Professional users can leverage verbal and visual inputs for creating presentations, editing photos, and developing narratives

Feature Accessibility: The strategic rollout demonstrates OpenAI’s tiered approach to feature distribution.

  • Premium subscribers gain immediate access to the full suite of new capabilities
  • Free tier users maintain access to basic features and special releases like the seasonal Santa chat
  • The global rollout strategy suggests a careful approach to scaling these resource-intensive features

Looking Forward: While this integration represents a significant step in making AI more intuitive, the premium pricing structure may limit widespread adoption among casual users, potentially creating a divide between those who can access these advanced features and those who cannot.

ChatGPT Advanced Voice with Vision just launched — here’s how to try it

Recent News

Machine learning “periodic table” accelerates AI discovery

MIT's systematic classification reveals mathematical connections across AI methods, enabling researchers to identify gaps and develop new algorithms that outperform existing approaches.

Oregon lawmakers crack down on AI-generated fake nudes

Oregon joins other states in expanding "revenge porn" laws to criminalize AI-generated fake explicit imagery, after law enforcement faced cases where clothed social media photos were manipulated into realistic-looking nudes without legal recourse.

Docker simplifies AI model deployment with new container workflow

Docker's new tools standardize AI model deployment by extending container principles to machine learning workflows, enabling developers to manage AI components with the same consistency and security as traditional applications.