Voice - CO/AI

News/Voice

Sep 25, 2024

ElevenLabs to Add Deepak Chopra Voice to Its Voice Library

AI-powered audio innovation: Deepak Chopra, renowned author and wellness expert, has partnered with AI firm ElevenLabs to add his voice to their roster of notable voices for audio streaming of written content. The collaboration stems from the "Digital Deepak" chatbot app launched in June, which uses AI to answer questions in Chopra's synthetic voice based on his extensive body of work. ElevenLabs has gained attention in the entertainment industry for partnerships with estates of celebrities like Judy Garland and James Dean, allowing users to hear books and articles read in iconic voices. This development coincides with OpenAI's introduction of new...

read Sep 24, 2024

The Next Time You Call a Restaurant You Might Be Talking to an AI

AI revolutionizing restaurant reservations: The hospitality industry is witnessing a significant shift as more restaurants turn to artificial intelligence to handle phone reservations, streamlining operations and addressing staffing challenges. Several startups, including Maitre-D AI, RestoHost, and Slang, have emerged in recent years, offering AI-powered reservation services to restaurants across major cities. These AI assistants are designed to alleviate the workload of restaurant staff, particularly in establishments facing chronic understaffing in the post-pandemic era. The technology aims to manage the high volume of calls received by popular restaurants, which can range from 800 to 1,000 calls per month in busy urban...

read Sep 23, 2024

Unauthorized AI Voice Cloning Sparks Controversy for YouTube Creator

Unauthorized voice cloning discovered: Jeff Geerling, a YouTube content creator, found that his voice had been used without permission in promotional videos for Elecrow, an electronics company. Geerling discovered the unauthorized use of his voice when a viewer alerted him to an Elecrow video that sounded like him but contained words he never said. The cloned voice was used in multiple tutorial series for Elecrow products, including those on ESP32 and RP2040. Geerling believes his YouTube videos were likely used to train an AI voice clone tool, which then generated the narration for Elecrow's videos. Ethical and legal concerns: The...

read Sep 20, 2024

AI Voice Calling Scams are on the Rise – Do You Have a Secret Phrase?

AI voice cloning scams gaining traction: A recent survey by a UK bank reveals a concerning trend in the rise of AI-generated voice cloning scams, with 28% of respondents reporting they have been targeted. Voice cloning scams involve criminals using AI technology to create convincing imitations of friends or family members' voices, claiming to be in urgent need of financial assistance. The advancement of AI technology has made it possible to generate realistic voice imitations using as little as three seconds of source material, often easily obtainable from social media videos. These scams represent an evolution of older text-based fraud...

read Sep 19, 2024

Google Unveils 10 New Voices for Gemini AI Assistant

Google expands Gemini's voice capabilities: Google has introduced 10 new voice options for its Gemini Live AI assistant on Android devices, enhancing the user experience and expanding the assistant's functionality. The new voices feature unique personalities and accents, each named after constellations, stars, and star-related phenomena, such as Orion, Capella, and Nova. Initially exclusive to Gemini Advanced subscribers, these voice options are now available to all users, democratizing access to advanced AI features. Google's advanced speech engine enables the voices to be more emotionally expressive and conversational, creating a more natural interaction experience. Enhanced functionality and user experience: Gemini's AI...

read Sep 18, 2024

Hume Launches ‘EVI 2’ AI Voice Model with Emotional Responsiveness

Hume's EVI 2: A leap forward in AI voice technology: Hume, an AI startup, has unveiled Empathic Voice Interface 2 (EVI 2), a significant upgrade to its AI voice model and API, offering enhanced naturalness, emotional responsiveness, and customizability. Key improvements and features: EVI 2 brings substantial enhancements over its predecessor, addressing critical aspects of AI voice interaction. The new version boasts a 40% reduction in latency, with response times ranging from 500 to 800 milliseconds, greatly improving real-time conversation capabilities. EVI 2 demonstrates improved emotional intelligence, better understanding and responding to the emotional context of user inputs. The system...

read Sep 18, 2024

AI Voice Scams Are Surging — Here’s How to Protect Yourself

AI voice-cloning scams pose growing threat: Starling Bank warns that millions could fall victim to fraudsters using artificial intelligence to replicate voices and deceive people into sending money. The UK-based online bank reports that scammers can clone a person's voice from just three seconds of audio found online, such as in social media videos. Fraudsters then use the cloned voice to impersonate the victim and contact their friends or family members, asking for money under false pretenses. Survey reveals alarming trends: A recent study conducted by Starling Bank and Mortar Research highlights the prevalence and potential impact of AI voice-cloning...

read Sep 15, 2024

New AI Tool Lets You Upload Silent Videos and Read Speakers’ Lips

AI-powered lip reading technology debuts: Symphonic Labs, an audio tech startup, has launched an online tool showcasing their AI's lip reading capabilities, potentially revolutionizing speech understanding in various contexts. The San Francisco and Canada-based company specializes in "multimodal speech understanding" tools, with applications ranging from voice calls in noisy environments to whispering commands to voice assistants in public. The startup's new website, readtheirlips.com, allows users to upload short video clips of speakers and receive text transcriptions of what the AI calculates is being said, even when the audio is inaudible. The tool requires clear visibility of the speaker's face and...

read Sep 13, 2024

Google Gemini App Launches Live Voice Mode for Android

Groundbreaking AI interaction: Google's Gemini app has launched its Live Voice Mode for Android users, offering free real-time voice conversations with AI. The new feature allows users to engage in dynamic, interactive dialogues with Gemini, moving beyond simple voice commands or text-based interactions. Live Voice Mode is designed to simulate human-like social interactions, enabling users to explore ideas, brainstorm, and discuss complex topics hands-free. How it works: Gemini's Live Voice Mode is integrated into the Android app, providing users with an intuitive and accessible way to converse with AI. Users can access the feature by updating or downloading the Gemini...

read Sep 12, 2024

Hume’s AI Assistants Bring Emotional Intelligence to LLMs

Hume AI's new "empathic voice interface" EVI 2 brings emotional intelligence to large language models, enabling AI assistants to speak more naturally and respond to users' emotions. Breakthrough in AI communication: EVI 2 adds emotionally expressive voices and emotion recognition capabilities to language models from major tech companies, marking a significant advance in human-AI interaction. The technology allows AI assistants to adopt appropriate emotional tones, such as speaking solemnly when delivering sad news. EVI 2 can measure a range of emotions in users' voices, including determination, anxiety, and happiness. Developers can deploy AI voices with specific emotional states using prompts,...

read Sep 11, 2024

A New Chinese Open-Source AI Is Giving Siri and Alexa a Run for Their Money

LLaMA-Omni, a new AI model developed by researchers at the Chinese Academy of Sciences, is poised to revolutionize how we interact with digital assistants by enabling real-time speech interaction with large language models (LLMs). Breakthrough in voice AI technology: LLaMA-Omni processes spoken instructions and generates both text and speech responses simultaneously, with latency as low as 226 milliseconds. Built on Meta's open-source Llama 3.1 8B Instruct model, LLaMA-Omni supports high-quality speech interactions. The system's low latency rivals human conversation speed, making it a potential game-changer for voice-enabled AI applications. Researchers highlight the growing demand for voice-enabled AI across various sectors,...

read Sep 11, 2024

AI Recreates Darth Vader’s Voice: What It Means for Actors

James Earl Jones' legacy and AI voice replication: The passing of legendary actor James Earl Jones at 93 has sparked discussions about the use of artificial intelligence to recreate his iconic Darth Vader voice. Jones, known for his distinctive voice and powerful performances across stage and screen, had consented to the use of AI to replicate his Darth Vader voice for the 2022 Disney+ show "Obi-Wan Kenobi." Skywalker Sound and Ukrainian company Respeecher employed AI technology to recreate Jones' performance after he stepped away from the role. This decision has raised questions about the future of voice acting and the...

read Sep 10, 2024

New Survey Shows Actors with Unique Voices at Greatest Risk of AI Voice Cloning

AI voice cloning presents significant risks for celebrities: The rapid advancement of AI voice cloning technology has raised concerns about potential misuse, particularly for celebrities with distinctive voices. Survey findings reveal most vulnerable celebrities: Arnold Schwarzenegger tops the list, with 61% of respondents agreeing his voice is the easiest to replicate due to his distinctive accent Donald Trump's voice is considered easy to copy by 39% of respondents, attributed to its familiarity from frequent media appearances Kim Kardashian's consistent tone and pitch make her voice easy to replicate according to 23% of those surveyed Methodology and survey details: The survey,...

read Sep 9, 2024

James Earl Jones’ Passing Marks Milestone for Debate about AI-Generated Voices

Iconic voice preserved through AI: James Earl Jones, who passed away at 93 on Monday, had given Lucasfilm permission to recreate his legendary Darth Vader voice using artificial intelligence. Jones, known for voicing Darth Vader in numerous Star Wars properties, expressed interest in concluding his role as the Sith Lord after providing lines for "The Rise of Skywalker." Lucasfilm partnered with Ukrainian company Respeecher to create an AI-generated version of Jones' Vader voice, based on his past performances. The AI-generated voice was used in the "Obi-Wan Kenobi" series, with Jones' approval to use his archive for training the speech model....

read Sep 9, 2024

Audible Launches Program for Narrators to Create AI-Generated Voices

AI-powered narration comes to Audible: Amazon's Audible is launching a beta program allowing narrators to create AI-generated voice clones, potentially revolutionizing audiobook production while raising questions about the future of human narration. The big picture: Amazon aims to expand its audiobook catalog by enabling faster production through AI technology, while attempting to balance the interests of authors, narrators, publishers, and listeners. The beta program, initially limited to a small group of US-based Audible narrators, will later extend to rights holders such as authors, agents, and publishers. Participants will submit voice recordings to train their AI replicas, retaining control over project...

read Sep 6, 2024

How to Use the ChatGPT Voice Widget on iPhone Lock Screens

AI voice assistants evolve on smartphones: OpenAI's introduction of ChatGPT widgets for iPhone lock screens marks a significant advancement in AI accessibility, particularly with the inclusion of ChatGPT Voice functionality. Key features of ChatGPT widgets: Two widgets are now available for iPhone lock screens One widget provides quick access to ChatGPT Voice, including Advanced Voice for eligible users The feature allows for more immediate AI interactions compared to traditional voice assistants Comparison with existing voice assistants: Siri remains the most deeply integrated assistant for iPhone users Apple is expected to announce upgrades to Siri at the upcoming "Glowtime" event Android...

read Sep 6, 2024

AI-Powered Lenovo Laptop Tracks Users and Responds to Voice

Lenovo's Innovative Auto Twist AI PC Concept: Lenovo unveiled a groundbreaking laptop concept at IFA 2024, featuring a motorized hinge and voice-activated controls for automatic lid and display movement. The Auto Twist AI PC can track user movements and rotate its display to face the user as they move around. Voice commands allow users to open and close the laptop lid without physical interaction. The concept demonstrates advanced AI integration in personal computing devices, potentially improving accessibility for users with limited mobility. Key Features and Functionality: The Auto Twist AI PC's motorized hinge enables a range of movements and positions,...

read Sep 4, 2024

California Mandates Consent for AI Deepfakes of Deceased Stars

California's AI deepfake legislation: The California state Senate has passed AB 1836, a bill requiring explicit consent from the estates of deceased performers for the creation of AI replicas in various media projects. The bill covers all forms of digital recreation using AI, including still images, voice clones, and full character portrayals in films. Producers must obtain agreement from the estate or legal representative of the deceased performer before using their AI replica. This legislation follows the recent passage of AB 2602, which focuses on consent requirements for AI replicas of living performers. Industry support and implications: SAG-AFTRA, the union...

read Sep 4, 2024

ChatGPT May Be Getting 8 New Voices To Enhance AI Interactions

ChatGPT's voice capabilities expand: OpenAI is reportedly developing eight new voices for ChatGPT, enhancing its already impressive voice interaction features and potentially setting a new standard in AI chatbot technology. The new voices, named Fathom, Glimmer, Harp, Maple, Orbit, Rainbow, Reef, Ridge, and Vale, are said to be highly realistic and varied in their sound. A notable improvement is the ability of these voices to mimic animal sounds while reading, addressing a long-standing challenge in AI voice synthesis. The expansion builds upon ChatGPT's existing four lifelike voices, further solidifying its position as a leading AI chatbot in comparison to competitors...

read Aug 27, 2024

D-ID Launches AI Video Translation Tool That Preserves Lip Movements and Voice

AI-powered video translation breakthrough: D-ID has introduced a new feature that can translate videos into 30 different languages while preserving the speaker's voice and lip movements. D-ID, known for its Deep Nostalgia technology, has expanded its AI capabilities to offer video translation services. The new tool clones the speaker's voice and adjusts lip movements to match the translated script, creating a seamless and natural-looking localized video. This technology aims to help content creators and businesses easily reach global audiences without the need for costly and time-consuming traditional dubbing methods. Key features and functionality: The AI Video Translate tool offers a...

read Aug 25, 2024

ElevenLabs New AI Reader App Just Expanded to 32 More Languages

AI-powered text-to-speech expands globally: ElevenLabs, an AI audio startup, has released its Reader app worldwide, significantly expanding its language support and accessibility. The Reader app, initially launched in June for the US, UK, and Canada, is now available globally on both iOS and Android platforms. The app's language capabilities have been expanded to include 32 different languages, such as Spanish, Portuguese, French, Arabic, Mandarin, and Hindi. Users can utilize the app to listen to various text formats, including PDFs, articles, newsletters, and ePub files, though support for Kindle and Apple Books files is not currently available. Diverse voice options and...

read Aug 20, 2024

ChatGPT’s Advanced Voice Feature Is Getting Solid Reviews from Early Testers

Groundbreaking AI voice technology emerges: OpenAI's ChatGPT Advanced Voice feature, currently in limited release to select ChatGPT Plus subscribers, represents a significant leap in natural language AI interactions. Key features and capabilities: Advanced Voice enables seamless speech-to-speech conversations with ChatGPT, offering a range of impressive functionalities that blur the line between AI and human interaction. Users can interrupt and redirect conversations mid-stream, mimicking natural human dialogue patterns. The AI adapts its tone, speed, and energy level to match the user, creating a more personalized and engaging experience. ChatGPT can take on various roles, such as storyteller, city guide, or personal...

read Aug 16, 2024

SAG-AFTRA Partners with Narrativ to Let Actors License Their Voices for AI

SAG-AFTRA's recent agreement with AI startup Narrativ marks a significant step in regulating the use of AI-generated voices in advertising, offering actors more control and compensation for their digital replicas. A new frontier in voice acting: SAG-AFTRA has partnered with Narrativ, an online platform that enables actors to license their voices for use in AI-generated advertisements. The platform allows SAG-AFTRA members to upload 30-60 second audio samples of their voices, which advertisers can then use to create AI-powered commercials. This collaboration represents a novel approach to integrating AI technology into the entertainment industry while protecting actors' rights and interests. The...

read Aug 15, 2024

AI Brain Implant Restores ALS Patient’s Voice

Groundbreaking AI-assisted speech restoration for ALS patient: A revolutionary experiment has successfully used brain implants and artificial intelligence to help an ALS patient regain a semblance of his natural voice. The patient's journey: Casey Harrell, a 46-year-old climate activist and father, gradually lost his ability to perform daily activities and communicate due to the debilitating effects of ALS. ALS, also known as Lou Gehrig's disease, progressively destroyed Mr. Harrell's muscle function, robbing him of cherished activities like walking with his wife and holding his daughter. The loss of speech was particularly devastating for Mr. Harrell, who could no longer sing...

read