Voice - CO/AI

News/Voice

Jan 31, 2025

ElevenLabs raises funding at $3.3B valuation for voice AI tech

Artificial intelligence voice cloning startup ElevenLabs has secured $180 million in Series C funding, reaching a $3.3 billion valuation. Funding details: The latest investment round was jointly led by Andreessen Horowitz and Iconiq Growth, with participation from several new and existing investors. NEA, World Innovation Lab, Valor, Endeavor Catalyst Fund, and Lunate joined as new investors Existing investors, including Sequoia Capital, Salesforce Ventures, and Smash Capital, increased their investments The company has now raised a total of $281 million since its founding in 2022 Company background: ElevenLabs, headquartered in London, specializes in developing advanced voice AI technology that can generate...

read Jan 31, 2025

New Google AI feature aims to handle your phone calls

Google has launched an experimental AI feature called "Ask For Me" that makes phone calls to local businesses on behalf of users to gather information about prices and availability. What's new: Google's AI-powered calling feature is currently being tested with auto shops and nail salons as part of the company's Search Labs program. The technology builds on Google's existing Duplex system, which already handles restaurant reservations The AI assistant explicitly identifies itself as automated when making calls Businesses retain the option to opt-out of receiving AI-generated calls Key functionality and limitations: The service aims to reduce the number of direct...

read Jan 16, 2025

Automakers are rolling out a new generation of in-car AI assistants, and this time they’ll be good

The automotive industry is rapidly deploying generative AI-powered voice assistants in vehicles, despite historical consumer skepticism about in-car voice technology. Current state of vehicle voice assistants: Traditional car voice assistants have frustrated drivers with frequent misunderstandings and interruptions, leading many consumers to abandon their use. Drivers report issues such as assistants activating during conversations and misinterpreting commands while driving Current systems often require drivers to use specific command phrases rather than natural language Multiple voice assistants in the same vehicle (Google Assistant, Alexa, Siri) create confusion about which commands work with which system New AI developments: Major automakers are implementing...

read Jan 14, 2025

The latest news on Amazon Alexa’s AI makeover

Latest Amazon AI Development: Amazon is working on upgrading Alexa with generative AI capabilities, though significant technical challenges remain before the virtual assistant can be relaunched as an AI Agent. Key challenges ahead: Amazon's AGI lead Rohit Prasad has identified several critical issues that need to be resolved before the enhanced version of Alexa can be released to the public. The primary concern is eliminating AI hallucinations, where the system generates false or fabricated information Response speed and overall reliability remain significant hurdles in the development process The technical team is focused on achieving near-zero hallucination rates before any public...

read Jan 14, 2025

Samsung’s Sketch-to-Image feature will soon let you incorporate voice commands too

One of Samsung's next big reveals will reportedly include a new AI-powered Sketch to Image tool that allows users to combine voice and text inputs with drawings to generate images. Latest Features: Samsung is expanding the capabilities of its Galaxy AI-powered Sketch to Image tool within Drawing Assist, introducing multimodal input functionality that enhances creative possibilities. Users can now complement their sketches with typed text or voice commands to generate more detailed and specific images The upgraded tool allows for intuitive combinations, such as sketching a cat and adding "spacesuit" via text or voice to create a composite image Previously...

read Jan 3, 2025

How to map ChatGPT voice mode to iPhone action button in 3 steps

OpenAI's ChatGPT Advanced Voice Mode can now be mapped to newer iPhone models' Action Button, providing easier access to voice-based AI interactions. Setup process: Mapping ChatGPT's Advanced Voice Mode to the Action Button involves several steps in the iPhone's settings. Navigate to Settings > Action Button Scroll to the "Shortcut" option Select ChatGPT from the third-party app list Choose "Start voice conversation" with the headphones icon Long-press the Action Button to activate the feature Usage and accessibility options: The feature offers multiple activation methods to suit user preferences. The Action Button provides tactile activation without looking at the screen Users...

read Jan 3, 2025

Religious leaders are beginning to integrate AI into their sermons

The heart of the story: A Houston rabbi's use of an AI-generated sermon and voice clone represents a growing trend of religious leaders incorporating artificial intelligence into their spiritual practices. Key developments: Religious institutions are increasingly experimenting with AI tools to enhance and modernize religious services and practices. Rabbi Josh Fixler of Congregation Emanu El in Houston demonstrated an AI system dubbed "Rabbi Bot" that could both write and deliver sermons in his voice The AI system, developed with help from a data scientist, was trained on the rabbi's previous sermons During services, Rabbi Fixler engaged in real-time dialogue with...

read Jan 3, 2025

Lucid EV drivers can now activate an AI assistant with voice commands

Lucid Motors announced its AI-powered in-vehicle assistant that responds to voice commands and provides real-time information to drivers. Core Feature Introduction: The Lucid Assistant, powered by SoundHound Chat AI, enables hands-free voice control and information access for Lucid EV drivers. Drivers can activate the assistant by saying "Hey Lucid" to access information about sports, stocks, weather, and vehicle features The system includes access to the full vehicle manual for answering specific questions about vehicle functionality Many features work without cellular connectivity, ensuring consistent availability Technical Implementation: SoundHound's proprietary technology aims to deliver more accurate and reliable responses compared to traditional...

read Dec 30, 2024

Emotional intelligence in AI will unlock human-computer interaction

The development of emotional intelligence in artificial intelligence systems represents a critical yet overlooked frontier in AI advancement, particularly in the context of voice technology and human-computer interaction. The current landscape: Voice AI technology, while advanced in many ways, still lacks fundamental emotional intelligence capabilities necessary for truly natural human-computer interaction. Current AI systems excel at processing information but struggle to interpret emotional context, dialect variations, and the nuances of human communication Voice recognition technology often fails to accurately process speech from older individuals and those with diverse accents Despite significant computational advances, AI systems remain limited in their ability...

read Dec 22, 2024

Returned.com makes online returns a breeze with its new AI voice agent

Online shopping returns platform Returned.com has launched a new voice AI feature that eliminates customer service wait times by automatically handling return requests through its AI Return Agent, BoughtBot. Key innovation: The new voice AI capability integrates with Returned.com's existing AI Return Agent to manage complex return scenarios without requiring direct customer interaction with retailers. The system can bypass phone menus, communicate naturally with representatives, and negotiate return policies and exceptions The technology operates seamlessly in the background while processing returns The service currently supports over 131 major retailers Technical implementation: The voice AI leverages natural language processing to ensure...

read Dec 21, 2024

The next time you dial 911 you may be talking to an AI

Key Implementation Details: Kalamazoo County Consolidated Dispatch Authority has introduced an AI virtual assistant named Ava to manage incoming non-emergency calls. The system, developed by Seattle-based software company Aurelian, is specifically designed for emergency communications centers Ava handles calls coming through the main non-emergency number (269-488-8911), which accounts for roughly 130,000 calls annually The AI assistant is capable of communicating in multiple languages and can quickly process requests and respond to questions Operational Benefits: The integration of AI technology aims to optimize dispatcher workload and improve response times for both emergency and non-emergency situations. Executive Director Jeffery Troyer emphasizes that...

read Dec 20, 2024

Not sure what to say on your 1-800-ChatGPT phone call? Try these use cases

Service Overview and Accessibility; The new phone-based ChatGPT service provides hands-free AI assistance for a wide range of everyday tasks and queries through a traditional phone call. Users can access AI help by simply dialing 1-800-ChatGPT from any phone, making the technology available to those who may not have smartphones or internet access The service processes natural voice conversations, eliminating the need to type queries or navigate apps Calls can be made while driving or during other activities where using a screen would be impractical or unsafe Practical Applications; Real-world testing reveals several valuable use cases that demonstrate the service's...

read Dec 16, 2024

Vapi nets $20M fundraise to bring AI voice agents to enterprises

The rapidly evolving field of conversational AI has gained a significant new player with Vapi securing substantial funding to expand its voice agent technology for enterprise applications. Funding details and company background: Vapi, a Voice AI agent development platform founded in 2023, has secured $20 million in Series A funding led by Bessemer Venture Partners. The funding round included participation from Abstract Ventures, AI Grant, Y Combinator, Saga Ventures, and Michael Ovitz Founded by Jordan Dearsley (CEO) and Nikhil Gupta (CTO), Vapi has scaled to millions in revenue within six months of launch The investment will support engineering team expansion,...

read Dec 16, 2024

OpenAI upgrades ChatGPT Search with voice queries and maps integration

OpenAI has expanded and enhanced ChatGPT's search capabilities with a suite of new features, making the tool more accessible and functional for users worldwide. Key Updates: OpenAI has introduced three significant changes to ChatGPT's search functionality, marking a substantial evolution from its initial SearchGPT preview phase. Search capabilities are now available to all users with free accounts, accessed through a globe icon in the compose bar The platform now prioritizes displaying prominent webpage links before generating text-based responses Advanced Voice Mode enables users to make voice queries and receive responses in their chosen AI voice from among 10 preset options...

read Dec 15, 2024

Practical tasks you can do with ChatGPT’s Advanced Voice with Vision feature

OpenAI has expanded ChatGPT's capabilities with Advanced Voice with Vision, a new feature combining voice interaction and image processing capabilities for Plus and Pro subscribers. Launch details and availability: OpenAI unveiled Advanced Voice with Vision during their '12 Days of OpenAI' demonstration, marking a significant expansion of ChatGPT's interactive capabilities. The feature is exclusively available to ChatGPT Plus and Pro subscribers who pay the $20 monthly fee A special 'Chat with Santa' feature will be accessible to all users, including those on the free tier The rollout is happening gradually on a global scale Core functionality: Advanced Voice with Vision...

read Dec 15, 2024

How to try ChatGPT’s new Advanced Voice with Vision feature

The recent integration of voice commands and visual analysis capabilities into ChatGPT marks a significant advancement in making artificial intelligence more accessible and interactive for everyday users. New Feature Overview: OpenAI has unveiled Advanced Voice with Vision as part of their '12 Days of OpenAI' demonstration, combining voice interaction and image analysis capabilities within ChatGPT. The feature enables users to interact with ChatGPT through spoken commands while also analyzing uploaded images and video content This enhancement is exclusively available to ChatGPT Plus and Pro subscribers, who pay $20 monthly for access A special 'Chat with Santa' feature has been made...

read Dec 11, 2024

How tools like NotebookLM use AI to generate podcasts

Google's NotebookLM transforms written documents into AI-generated audio summaries that mimic podcast-style presentations. Core functionality and purpose: NotebookLM serves as an AI-powered research assistant that helps users process and understand uploaded documents through interactive analysis and summarization. The platform allows users to upload documents or links and engage with AI to extract key information and insights Its Audio Overviews feature converts text content into conversational audio summaries that sound similar to human-hosted podcasts The tool is designed to act as a "thought partner" for researchers and writers working with complex information Technical capabilities: The system employs advanced natural language processing...

read Dec 9, 2024

Gemini is getting close to allowing for multilingual voice interactions

AI and technology giant Google is expanding the multilingual capabilities of its Gemini AI assistant with forthcoming support for bilingual conversations. Current status and functionality: Google's latest development enables users to engage with Gemini Live in two different languages during the same conversation, though the feature is not yet widely available. The functionality is specifically designed for speech-based interactions Support documentation indicates compatibility with over 30 languages Despite being documented in Google's support pages, the feature remains inaccessible to most users Technical implementation: Recent analysis of Google's beta app reveals progress toward implementing this multilingual capability. Evidence of the feature...

read Dec 9, 2024

AI voice startup by former OpenAI researcher raises $40M from A16Z

The big picture: Former OpenAI lead researcher Alexis Conneau's new venture WaveForms AI has secured $40 million in seed funding from Andreessen Horowitz, achieving a $200 million valuation. The San Francisco-based company aims to create AI audio software that can detect emotional nuances and facilitate more natural conversations between humans and machines WaveForms launched on December 9, 2024, joining a growing wave of AI companies expanding beyond text-based interactions The startup's focus on voice technology signals a shift toward more sophisticated human-computer interaction methods Leadership background and timing: Conneau's transition from OpenAI to founding WaveForms demonstrates the rapid evolution and...

read Dec 7, 2024

How to protect your family from AI voice clones claiming to be you

The rise of AI-powered voice cloning has prompted new security recommendations from law enforcement to protect against increasingly sophisticated scam attempts targeting families. Key development: The FBI has issued official guidance recommending families establish secret passwords to verify identity during suspicious calls, particularly those claiming to be emergency situations involving loved ones. The recommendation comes through an official public service announcement (I-120324-PSA) released on Tuesday The FBI suggests creating unique, private phrases that family members can use to authenticate each other's identity Voice verification has become necessary as criminals deploy AI technology to create convincing voice clones for fraudulent purposes...

read Dec 6, 2024

How to use ElevenLabs’ AI agent Matilda to level up your math skills

Key innovation: ElevenLabs recently introduced Conversational AI agents, featuring Matilda, a specialized math tutor that combines natural language processing with educational expertise. Matilda engages in natural conversations about mathematics, allowing students to ask questions and receive detailed explanations about various math concepts. The AI tutor demonstrates particular effectiveness in explaining topics like similar shapes and graphing through interactive dialogue. Unlike traditional chatbots, Matilda can handle interruptions and maintain context throughout the conversation, creating a more authentic tutoring experience. Accessibility and pricing structure: ElevenLabs offers a flexible approach to AI tutoring that makes it accessible to a wide range of users....

read Dec 4, 2024

ElevenLabs unveils lifelike conversational AI agents

Voice technology and artificial intelligence are converging to create more natural human-computer interactions, with ElevenLabs' new Conversational AI system leading this transformation. Product Overview: ElevenLabs has launched a voice bot system that simulates phone conversations with remarkable human-like qualities. The system allows users to customize voices through selection, design, or voice cloning capabilities Users can integrate their own knowledge bases to create specialized AI assistants The platform supports multiple language models from OpenAI, Google, and Anthropic, with options for custom model integration Technical Architecture: The system employs a sophisticated multi-step process to deliver seamless voice interactions. Unlike direct speech-to-speech systems,...

read Dec 4, 2024

ElevenLabs’ AI podcast generator offers 32 languages and dozens of voices

The emergence of AI-powered podcast generation tools is expanding the accessibility of audio content creation, with ElevenLabs Reader and Google's NotebookLM leading the way in transforming written content into conversational audio formats. Product Overview; ElevenLabs has launched a free iOS app called ElevenLabs Reader that converts various text formats into AI-generated podcasts using its GenFM feature. The app can process content from webpages, written text, PDFs, and scanned documents It automatically selects appropriate AI voices for host and co-host roles The platform supports 32 different languages, including English, Spanish, French, Italian, Japanese, Chinese, and Hindi Technical Capabilities; The ElevenLabs ecosystem...

read Dec 3, 2024

Argentine film to be the first with AI voice dubbing in original actors’ voices

The rise of AI-powered movie dubbing marks a significant shift in how foreign films may soon reach international audiences, with potential ripple effects across the entertainment industry. The groundbreaking development: Argentine horror-fantasy film "The Witch Game" is set to become the first theatrically released movie using AI-generated dubbing to recreate the original actors' voices in English. The film follows a young woman who receives a VR headset for her 18th birthday, transporting her to a mysterious magical school in another world Director Fabian Forte's Harry Potter-inspired creation will bypass traditional dubbing methods, using AI to translate and synchronize dialogue while...

read