The launch of Google’s Gemini Live brings advanced AI conversational capabilities to mobile users across both major platforms, marking a significant shift in how people can interact with AI assistants.
Major development: Google has made Gemini Live, its conversational AI assistant, freely available to both iOS and Android users, expanding access to natural language AI interactions.
- The assistant can engage in human-like conversations across 30 different languages, including Arabic, Chinese, French, German, and Spanish
- Users can interrupt conversations, have multi-turn dialogues, and return to previous discussions
- The feature was previously limited to Gemini Advanced subscribers who paid $20 monthly
Accessibility and setup: Setting up Gemini Live involves a straightforward process on both major mobile platforms.
- Android users can download the Gemini app from the Google Play Store and look for the waveform icon at the bottom left of the screen
- iOS users can access Gemini Live through the new Gemini app in the App Store, which quickly reached #17 in the Productivity category
- Both versions require users to sign in with a Google account and grant microphone permissions
Key capabilities: Gemini Live extends beyond traditional voice assistant functionality with enhanced conversational abilities.
- The AI can maintain context throughout conversations, enabling natural follow-up questions
- Users can engage in abstract discussions about feelings, life, and ideas
- The system can provide advice and maintain more nuanced conversations than conventional voice assistants
Competitive landscape: Google’s free offering stands out in a market where similar capabilities often come with a premium price tag.
- ChatGPT’s Voice Mode, a comparable feature, is only available to ChatGPT Plus subscribers at $20 monthly
- Microsoft’s Copilot Voice offers similar capabilities and is also available for free on both iOS and Android
- Future updates will include visual context capabilities, allowing the assistant to use phone cameras to inform responses
Future developments: The integration of visual capabilities promises to expand Gemini Live’s functionality, though implementation timing remains unclear.
- Both Google and OpenAI plan to incorporate camera functionality to provide environmental context
- This feature will allow the AI to better understand and respond to users’ surroundings
- No specific launch dates have been announced for these visual capabilities
Market implications: The widespread availability of sophisticated AI assistants at no cost could reshape user expectations for digital interactions.
Gemini Live is available to all iOS and Android users now - for free. How to try it