News/Voice

May 20, 2025

AI voice scams target US officials at federal, state level to steal data

The FBI is warning about sophisticated smishing campaigns targeting current and former government officials that use AI-generated voices and social engineering techniques to steal sensitive information. This escalation represents a concerning evolution in government-targeted scams, as cybercriminals impersonate senior officials to establish trust before directing victims to malicious links that compromise personal accounts. The big picture: Since April, cybercriminals have been targeting U.S. federal and state employees with texts and AI-generated voice messages that impersonate senior officials to establish rapport and ultimately gain access to sensitive information. Once scammers compromise one account, they use the stolen information to target additional...

read
May 20, 2025

Actors’ union bypassed by Epic Games in use of AI-voiced Darth Vader

SAG-AFTRA's complaint against Epic Games over AI-generated Darth Vader voice marks a significant escalation in the ongoing tensions between entertainment unions and game developers. This case highlights the evolving battle over AI voice rights in creative industries, where technological capabilities are outpacing established contractual frameworks. The dispute illustrates how AI voice cloning is creating new legal and ethical challenges around performer rights and proper compensation. The big picture: SAG-AFTRA has filed an Unfair Labor Practice complaint against Epic Games for using an AI-generated version of James Earl Jones' Darth Vader voice in Fortnite without union negotiation. Epic Games acquired rights...

read
May 19, 2025

AI assistant Gemini coming to Samsung and Sony earbuds

Google plans to expand Gemini AI assistant access to various audio devices across its ecosystem, marking a strategic shift away from Google Assistant as the company pushes its newer AI technology to more hardware. This expansion brings enhanced AI capabilities to earbuds and wearables, allowing for more sophisticated interactions than previously possible with Google Assistant. The big picture: Samsung will bring Gemini to its Galaxy Buds 3 and Galaxy Watches in the "coming months," while Sony earbuds will also gain Gemini integration as part of Google's broader expansion to cars, TVs, and Pixel Watches. Key capabilities: Gemini on these audio...

read
May 19, 2025

Whisper AI transcribes 10x faster with new Inference Endpoints

Hugging Face has launched a dramatically improved Whisper model deployment option on Inference Endpoints, delivering up to 8x faster performance for audio transcription services. This advancement makes powerful transcription capabilities more accessible and cost-effective, bringing enterprise-grade speech recognition within reach of more organizations through optimized open-source technology. The big picture: Hugging Face's new Whisper deployment leverages the open-source vLLM project to achieve substantial performance gains without sacrificing transcription quality. The solution specifically targets audio transcription efficiency using Whisper Large V3, which demonstrates nearly 8x improvement in real-time factor (RTFx) compared to previous versions. Word Error Rate (WER) evaluations across eight...

read
May 19, 2025

Siri, why are you procrastinating? Apple may delay assistant upgrades until after WWDC

Apple's troubled AI voice assistant development is facing further delays, with no plans to showcase improvements at next month's WWDC event. This setback comes amid internal turmoil described by some Apple employees as a "crisis," highlighting the growing gap between Apple's AI capabilities and those of its competitors. The strategic shift toward a new LLM-based Siri architecture demonstrates Apple's acknowledgment that fundamental changes are needed to keep pace in the increasingly competitive AI assistant landscape. The big picture: Apple will not present any Siri voice assistant improvements at WWDC in June, focusing instead on other Apple Intelligence features like the...

read
May 19, 2025

NBC adopts AI-generated voice for NBA broadcasts

NBC's decision to resurrect the voice of deceased sportscaster Jim Fagan using AI for its upcoming NBA coverage marks a significant advancement in synthetic media use in mainstream broadcasting. By digitally recreating the voice of a beloved narrator who died in 2017, NBC is blending nostalgia with cutting-edge technology to enhance its basketball programming. This development highlights the growing trend of posthumous digital recreations in entertainment and raises important questions about the future relationship between AI-generated content and traditional media production. The big picture: NBC will use an AI-generated version of Jim Fagan's voice across selected NBA content beginning October...

read
May 17, 2025

AI-powered Darth Vader shocks fans with unexpected profanity

Artificial intelligence voice technology faces a new embarrassing mishap as Darth Vader's character in Fortnite temporarily spewed profanity and inappropriate language. The incident highlights ongoing challenges with AI voice implementations in mainstream entertainment, exposing the risks of deploying synthetic voice technology based on iconic performers in interactive environments where user input can trigger unexpected responses. The incident: Epic Games' Fortnite featured an AI-powered Darth Vader that briefly responded to players with profanity and inappropriate language. A viral exchange occurred when Twitch streamer Loserfruit triggered the character to repeat expletives, with Vader responding, "Such vulgarity does not become you, Padme." According...

read
May 15, 2025

AI narration impresses on Audible, but human voices still reign

Audible's introduction of AI-powered narration technology represents a significant shift in audiobook production, creating tension between accessibility and artistic quality in digital storytelling. While this technology promises to transform millions of unrecorded books into accessible audio formats, it raises important questions about the value of human performance, the future of voice acting careers, and where automation should complement rather than replace human artistry in creative industries. The big picture: Audible is rolling out a fully integrated AI production pipeline that can auto-generate entire audiobooks with synthetic voices, targeting publishers looking for faster and cheaper alternatives to human narration. The technology...

read
May 15, 2025

Microsoft tests “Hey, Copilot!” voice activation in Windows

Microsoft is expanding voice control capabilities in Windows 11 with a new wake word feature that allows users to activate Copilot through voice commands alone. This update represents Microsoft's latest effort to fully integrate its AI assistant into the Windows experience, following in the footsteps of voice-activated assistants from Apple, Google, and Amazon while building upon capabilities previously offered by Microsoft's discontinued Cortana assistant. The big picture: Microsoft is adding a "Hey Copilot!" wake word feature to Windows 11, allowing users to activate the AI assistant hands-free. The update is currently rolling out to Windows Insiders worldwide who have English...

read
May 13, 2025

What’s your pleasure? Spotify AI DJ now responds to user song requests

Spotify is expanding the functionality of its AI DJ feature by introducing voice command capability, giving Premium subscribers greater control over their music experience. This update transforms the DJ feature from a passive, algorithm-driven playlist generator into an interactive assistant that can respond to specific requests for artists, genres, moods, and even conceptual music experiences. The enhancement addresses a key limitation of the original feature by allowing users to actively shape their listening sessions rather than simply accepting the AI's selections. How it works: Spotify Premium users can now use English voice commands to personalize their AI DJ experience by...

read
May 13, 2025

New Meta AI app unifies smart glasses and phone experiences

Meta's new standalone AI assistant app represents a significant evolution in how users can interact with conversational AI across platforms. Built on the Llama 4 model, this personalized assistant integrates with Meta's ecosystem while introducing voice capabilities and social discovery features. The app's design bridges the gap between Meta's smart glasses and other devices, creating a unified experience that maintains conversations across platforms. The big picture: Meta has launched its first standalone Meta AI app across iOS, Android, web browsers, and Ray-Ban Meta smart glasses with personalization features and enhanced voice capabilities. The app leverages Meta's Llama 4 model to...

read
May 9, 2025

AI headphones clone multiple voices for real-time translation

Yes, please do listen to the voices in your head. Researchers have developed a groundbreaking AI headphone system that can simultaneously translate multiple speakers in real-time, potentially eliminating language barriers in multilingual group conversations. The Spatial Speech Translation system not only converts foreign languages into English text but also preserves each speaker's unique vocal characteristics and emotional tone, creating a more natural translation experience than existing technologies. This innovation could transform international communication by enabling people to confidently express themselves across language divides. How it works: The University of Washington's Spatial Speech Translation system uses AI to track and translate...

read
May 8, 2025

How to claim your payout as Apple settles $95M Siri lawsuit

Apple's decision to settle a $95 million lawsuit over Siri's unauthorized voice recordings marks a significant development in the growing tension between voice assistant technology and privacy rights. The settlement offers compensation to millions of Apple users whose private conversations may have been inadvertently captured by Siri between 2014 and 2024, highlighting how even accidental data collection can trigger substantial legal consequences for tech companies. The big picture: Apple has agreed to pay $95 million to settle claims that Siri recorded private conversations without user consent, potentially affecting millions of customers who owned Siri-enabled devices over a ten-year period. The...

read
May 8, 2025

Is voice-driven ambient AI our future, as Zuckerberg predicts?

The convergence of voice computing and AI represents a potential paradigm shift in personal computing, with both Meta CEO Mark Zuckerberg and Apple SVP Eddy Cue pointing toward a future where typing may no longer dominate our digital interactions. This alignment between two tech giants that often clash suggests a shared vision of how AI could fundamentally transform how humans interact with technology in the coming decade. The big picture: Mark Zuckerberg highlighted at LlamaCon that voice interaction is currently "under-indexed," with 95% of digital interaction being text-based despite voice likely playing a much larger role in the future. Why...

read
May 7, 2025

AI-powered signing avatars bring communication breakthrough for deaf users

Silence Speaks is pioneering AI-powered sign language translation to bridge critical communication gaps for deaf and hard-of-hearing communities. This British startup addresses a global challenge affecting over 70 million sign language users who face isolation in noisy environments like train stations, hospitals, and classrooms where background noise and crosstalk make understanding speech nearly impossible. By developing an avatar that accurately translates text to British Sign Language (BSL) with emotional nuance, the company aims to transform accessibility for the 150,000+ BSL users in the UK. The big picture: Silence Speaks has created an AI-powered avatar that translates text into British Sign...

read
May 7, 2025

AI revives Jim Fagan’s voice for NBC’s NBA comeback

NBC Sports is reviving a voice from its basketball broadcasting past through AI technology, setting the stage for its NBA comeback after more than two decades. The network announced it will use an AI-generated voice of the late Jim Fagan, who died in 2017, for its NBA coverage beginning in October 2025 when NBC's new 11-year media rights deal takes effect. This nostalgic move aims to recreate the iconic sound that defined NBC's NBA coverage during the Jordan-era 1990s, while raising questions about how networks will blend tradition with technology in sports broadcasting. The big picture: NBC Sports is employing...

read
May 5, 2025

RealtimeVoiceChat enables natural AI conversations on GitHub

Real-time voice chat technology is advancing rapidly, enabling natural-sounding AI conversations with minimal latency. This open-source project demonstrates how sophisticated speech recognition, large language models, and text-to-speech systems can be integrated to create fluid, interruptible voice interactions that mimic human conversation patterns, showcasing the potential for more intuitive human-AI interfaces. Key features of this real-time AI voice chat system 1. End-to-end voice conversation architectureThe system creates a complete voice interaction loop by capturing user speech through the browser, processing it server-side, and returning AI-generated speech. This architecture prioritizes low latency and natural conversational flow above all else. 2. Real-time processing...

read
May 1, 2025

HitPaw offers 60% discount in Mother’s Day promotion

HitPaw's Mother's Day sales campaign offers substantial discounts on its AI creative tools, potentially helping users create personalized gifts while saving money. The limited-time promotion combines significant price reductions with a social media giveaway, creating multiple incentives for consumers to engage with the brand during this seasonal shopping period. The big picture: HitPaw is running a Mother's Day promotion from April 29 to May 16, 2025, offering up to 60% discounts on its AI-powered creative software suite and a chance to win prizes through a social media giveaway. Key details: The promotion includes discounts on HitPaw's entire product line, which...

read
Apr 30, 2025

Google AI podcast tool speaks 50 languages, including Zulu

Google's NotebookLM is expanding its popular AI-powered "Audio Overview" feature to 50 additional languages, addressing a growing demand for multilingual content transformation tools. This significant update to Google's experimental AI notebook allows users to convert text into podcast-style conversations between AI hosts in languages ranging from Spanish and Zulu to Afrikaans, powered by Gemini 2.5 Pro. The feature's ability to transform dense content into engaging, conversational formats has important implications for education, language learning, and cross-cultural content accessibility. The big picture: Google is dramatically expanding the reach of NotebookLM's viral Audio Overview feature by adding support for 50 new languages,...

read
Apr 28, 2025

AI presentation voiceovers: Free tool enhances boring ol’ slide decks

AI Presentation Narrator represents a new tool for quickly transforming static slide decks into professional-sounding videos with AI-generated voiceovers. This advancement could significantly impact how professionals and educators create and share presentation content, potentially saving hours of recording time while maintaining a polished delivery that traditionally required voice talent or extensive personal recording sessions. The big picture: AI Presentation Narrator enables users to convert PowerPoint or other slide deck formats into fully narrated video presentations without manual recording. How it works: The tool processes uploaded slide presentations and automatically generates natural-sounding voiceovers based on the text content within each slide....

read
Apr 26, 2025

AI calms panic attack: User shares ChatGPT experience

ChatGPT is demonstrating its versatility beyond traditional use cases, showing meaningful applications for mental health support during moments of crisis. A tech journalist's personal experience reveals how AI voice technology provided unexpected comfort during a panic attack, highlighting new dimensions of human-AI interaction in emotionally vulnerable situations. The unexpected helper: ChatGPT Voice and Vision provided real-time assistance during a panic attack when the author was alone between work meetings. The AI recognized the situation after some initial confusion and quickly shifted into a supportive role, guiding the user through breathing exercises and grounding techniques. Using a calm, steady voice, the...

read
Apr 26, 2025

All Voice Lab pushes AI voice generation to new heights

The artificial intelligence voice generation landscape continues to evolve with increasingly sophisticated offerings aimed at creators and developers. All Voice Lab represents a significant advancement in text-to-speech technology, bringing ultra-realistic voice capabilities through its state-of-the-art MaskGCT 2.0 model. As AI-generated voices become increasingly indistinguishable from human speech, platforms offering both accessibility and high fidelity are positioned to transform audio content creation across multiple industries. The big picture: All Voice Lab has launched as a comprehensive AI voice platform offering ultra-realistic text-to-speech and voice cloning capabilities powered by advanced deep learning technology. The platform utilizes the state-of-the-art MaskGCT 2.0 model, suggesting...

read
Apr 25, 2025

AI has found its voice — and it can scream

Dia, a new open-source AI voice model from Nari Labs, breaks new ground by mastering emotional expression in synthetic speech, particularly excelling at realistic screaming. This development signifies a pivotal shift in AI voice technology as the industry moves beyond merely sounding human to convincingly expressing the full spectrum of human emotion, potentially transforming how AI assistants, customer support bots, and entertainment applications connect with users. The innovation gap: Dia distinguishes itself by tackling a challenging aspect of speech synthesis that major AI voice models have largely overlooked. Most commercial AI voices achieve naturalness by smoothing tone, which inadvertently limits...

read
Apr 25, 2025

Perplexity launches AI voice assistant to rival Siri on iOS

Perplexity's new Voice Assistant feature brings powerful AI capabilities to iOS users, offering a compelling alternative to Apple's delayed Siri enhancements. This update significantly expands accessibility to conversational AI on Apple devices, notably including older iPhones and iPads that won't support Apple Intelligence, while providing sophisticated integration with various apps and services. The big picture: Perplexity has updated its iOS app with a new Voice Assistant feature that allows users to interact with AI through voice commands even when navigating away from the app. The assistant can perform complex tasks across multiple apps, including drafting emails, controlling smart home devices,...

read
Load More