×
AI voice app Hume impresses but still needs improvement
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

Hume AI’s new voice app enters the conversational AI arena: Hume AI has launched a browser-based AI voice application, leveraging its proprietary speech-language model EVI 2 and supplementary language models like Anthropic’s Claude 3.5 Haiku.

Key features and functionality: The app offers a diverse range of AI voices, each with distinct personalities and specialized knowledge domains, catering to various user needs.

  • Users can interact with the AI voices through their microphone without requiring an account, enhancing accessibility.
  • The application includes specialized AI personas for quick answers, philosophical discussions, and storytelling, among others.
  • Hume AI’s approach combines its own technology with established language models, potentially offering a unique user experience.

User experience and performance: Initial testing reveals a mix of impressive capabilities and areas requiring improvement, highlighting the app’s potential and current limitations.

  • The Quick Answers bot demonstrated accuracy in providing information, showcasing the app’s ability to handle factual queries effectively.
  • The Storytelling bot exhibited some issues, including repetition and unexpected voice changes, indicating room for refinement in narrative generation.
  • Users noted some overlap in responses between different AI personas, suggesting a need for more distinct specialization among the voices.
  • Occasional repetition of dialogue lines by some bots was observed, pointing to areas where conversational flow could be enhanced.

Competitive landscape: Hume AI’s voice app enters a market where established players like OpenAI’s ChatGPT Advanced Voice model already have a presence.

  • The app’s browser-based accessibility and variety of specialized voices could potentially differentiate it from existing offerings.
  • As a new entrant, Hume AI faces the challenge of competing with more established AI conversational platforms while addressing its current limitations.

Technical implementation: The combination of Hume AI’s EVI 2 model with other language models represents an interesting approach to voice AI technology.

  • This hybrid model potentially allows Hume AI to leverage its strengths in speech processing while benefiting from the broader capabilities of established language models.
  • The effectiveness of this approach in delivering a seamless and coherent user experience remains to be fully proven.

Potential impact and future prospects: While showing promise, the app’s success will likely depend on Hume AI’s ability to refine and improve its offering.

  • Addressing issues like repetition, voice consistency, and clearer differentiation between AI personas could significantly enhance the user experience.
  • The app’s potential to compete with established players in the voice AI space will depend on continuous improvement and possibly the introduction of unique features or use cases.

Analyzing deeper: As voice AI technology continues to evolve, Hume AI’s entry into this space reflects the growing interest in more natural and accessible human-AI interactions. The success of such applications could influence how we interact with AI in daily life, potentially shifting expectations for digital assistants and information retrieval systems. However, the path from promising prototype to widely adopted tool often requires significant refinement and user-centric improvements.

I tried Hume's new AI voice app — it's good but needs some work

Recent News

New benchmark evaluates AI agents and humans on research capabilities

AI systems excel at rapid testing but fall short of human experts in complex machine learning tasks requiring sustained problem-solving and knowledge integration.

MongoDB introduces new AI and data integrations through Microsoft partnership

MongoDB's cloud database now connects directly with Microsoft's AI tools across 48 global regions, simplifying how companies build and deploy custom AI applications.

Microsoft’s delayed Recall AI is finally available on Copilot Plus PCs

Microsoft's AI-powered search tool creates secure, locally-stored archives of PC activity, allowing users to find and extract content from their digital histories using natural language queries.