News/Agents

Apr 2, 2025

AI race intensifies as Zoom, Cerence, and tech giants expand capabilities

Major AI players are aggressively expanding their capabilities, with Zoom embracing agentic AI, Cerence targeting in-car assistants, and tech giants forming powerful infrastructure partnerships. This flurry of activity demonstrates how AI development is accelerating across industries, from enhancing workplace productivity to transforming customer experiences in retail and automotive environments. The big picture: Zoom is transforming its AI Companion into an agentic tool capable of autonomous task execution and decision support across its entire product portfolio. The updates were announced at Enterprise Connect in Orlando and will impact applications including Zoom Meetings, Team Chat, and Docs. According to Chief Product Officer...

read
Apr 1, 2025

New framework prevents AI agents from taking unsafe actions in enterprise settings

Singapore Management University researchers have developed a promising solution to a critical challenge facing AI agents in enterprise settings. AgentSpec presents a new approach to improving agent reliability and safety by creating a structured framework that constrains AI agents to operate only within specifically defined parameters—addressing a major barrier to enterprise adoption of more autonomous AI systems. The big picture: AgentSpec is a domain-specific framework that intercepts AI agent behaviors during execution, allowing users to define structured safety rules that prevent unintended actions without altering the core agent logic. The approach has proven highly effective in preliminary testing, preventing over...

read
Apr 1, 2025

Singapore researchers create “ambient agents” framework to control agentic AI with 90% safety improvement

Singapore Management University researchers have created a framework that significantly improves AI agent safety and reliability, addressing a critical obstacle to enterprise automation. Their approach, AgentSpec, provides a structured way to control agent behavior by defining specific rules and constraints—preventing unwanted actions while maintaining agent functionality. The big picture: AgentSpec tackles the fundamental challenge that has limited AI agent adoption in enterprises—their tendency to take unintended actions and difficulty in controlling their behavior. The framework acts as a runtime enforcement layer that intercepts agent behavior and applies safety rules set by humans or generated through prompts. Tests show AgentSpec prevented...

read
Mar 31, 2025

Executive high function: Microsoft’s new AI agents will give execs data science powers by year’s end

Microsoft is pushing the boundaries of executive decision-making with two new AI reasoning agents that represent a significant leap in workplace automation. The company's Researcher and Analyst agents, built on OpenAI's advanced models and integrated into Microsoft 365 Copilot, are designed to transform how business leaders process information and extract insights from complex data. These tools, available starting April 2025 through a new Frontier early access program, promise to fundamentally change how executives interact with their organization's information ecosystem. The big picture: Microsoft's two new AI reasoning agents within Microsoft 365 Copilot represent a strategic advancement in AI-powered workplace tools...

read
Mar 31, 2025

Companies investing 3x more in AI tech than corresponding human talent, according to new study

Agentic AI is emerging as the next evolutionary leap in enterprise automation despite generative AI's still-developing business impact. Accenture's extensive research involving 3,400 executives and 2,000 client projects reveals only 13% of current AI initiatives deliver significant business value. This talent and skills gap becomes even more critical as organizations race toward implementing AI agents—automated systems designed to perform specific tasks with increasing autonomy—requiring companies to develop specialized expertise in both AI technology and business applications. The big picture: Organizations are significantly underinvesting in human talent needed to effectively implement AI, spending three times more on technology than on people....

read
Mar 31, 2025

Study shows type safety and toolchains are key to AI success in full-stack development

Autonomous AI agents are showing significant progress in complex coding tasks, but full-stack development remains a challenging frontier that requires robust evaluation frameworks and guardrails to succeed. New benchmarking research reveals how model selection, type safety, and toolchain integration affect AI's ability to build complete applications, offering practical insights for both hobbyist developers and professional teams creating AI-powered development tools. The big picture: In a recent a16z podcast, Convex Chief Scientist Sujay Jayakar shared findings from Fullstack-Bench, a new framework for evaluating AI agents' capabilities in comprehensive software development tasks. Why this matters: Full-stack coding represents one of the most...

read
Mar 28, 2025

Precocious AI: Stanford’s open-source NNetNav agent rivals GPT-4 while learning like a child

Stanford researchers have developed NNetNav, an open-source AI agent that can perform tasks on websites by learning through exploration, similar to how children learn. This development comes as major tech companies like OpenAI, ByteDance, and Anthropic are releasing commercial AI agents that can take actions online on behalf of users. NNetNav addresses key concerns about proprietary AI systems by being fully transparent, more efficient, and equally capable while remaining completely open source. The big picture: Stanford graduate student Shikhar Murty and professor Chris Manning have created an AI system that can reduce the burden of repetitive computer tasks while addressing...

read
Mar 26, 2025

Observe.AI launches VoiceAI agents to automate routine customer service calls

Observe.AI's launch of VoiceAI agents represents a significant advancement in contact center automation, blending various AI technologies to handle routine customer interactions. This solution addresses growing enterprise interest in automating basic customer service tasks while allowing human agents to focus on more complex issues, potentially transforming the economics and experience of customer service operations. The big picture: Observe.AI has released VoiceAI agents to automate routine customer service interactions, positioning itself as the only complete AI-powered platform supporting the entire customer journey. The new solution is designed to handle everything from simple FAQs to multi-step conversations, eliminating long hold times and...

read
Mar 25, 2025

Going with the enterprise workflow: Domo launches Agent Catalyst as AI agents reshape 2025

AI agents are rapidly entering enterprise environments in 2025, with Domo's newly launched Agent Catalyst joining offerings from Microsoft and Salesforce in enabling autonomous workflows. Despite their growing capabilities, these AI agents still require significant human collaboration to function effectively. Domo's platform demonstrates the evolution of this technology while highlighting the importance of employee adoption, psychological safety, and data literacy for successful implementation. The big picture: Domo has unveiled Agent Catalyst, an AI agent capability that allows enterprises to build end-to-end autonomous workflows that both analyze data and complete actions without human intervention. To promote adoption, Domo is offering to...

read
Mar 24, 2025

Resend CEO: How designing for AI agents is reshaping developer tools and email

The intersection of AI agents and developer tools is creating new paradigms for building software and communication platforms. In a recent a16z podcast, Resend CEO Zeno Rocha explores how generative AI is transforming email experiences, introducing the concept of "agent experience" as a natural evolution of developer experience, and how technologies like Multi-Agent Collaboration Protocol (MCP) are reshaping programming fundamentals. The big picture: Developer experience expert Zeno Rocha reveals how his focus has shifted to designing for "agent experience" – creating interfaces and APIs optimized for AI agents that both build and operate within modern software products. Email's AI transformation:...

read
Mar 24, 2025

Pokémon No-Go: Claude’s advanced AI struggles to navigate Pokémon Red despite 3.7 upgrade

Anthropic's advanced AI agent Claude 3.7 Sonnet is struggling to complete the decades-old children's game Pokémon Red, despite being one of the industry's most sophisticated AI models. This experiment highlights the significant gap between current AI capabilities and true autonomous agency, as Claude's difficulties with basic visual processing and navigation demonstrate that even advanced language models still face fundamental challenges when interacting with virtual environments. The big picture: Anthropic is livestreaming "Claude Plays Pokémon" as a demonstration of AI agent capabilities, but progress has been painfully slow and inconsistent. Claude has managed to obtain three Gym badges and reach Cerulean...

read
Mar 19, 2025

How enterprises are balancing agentic automation with human oversight

Agentic AI is emerging as a transformational approach for enterprises seeking to integrate intelligent automation into their operations. While conceptually straightforward—deploying large language models as modular workflow components with human oversight—the practical implementation requires careful planning and strategic execution. Organizations that effectively balance AI capabilities with human expertise will gain competitive advantages in productivity, but must navigate ethical concerns and establish clear boundaries for these increasingly capable AI systems. The big picture: Agentic AI represents a fundamental shift in how organizations approach human-machine collaboration, focusing on AI handling routine tasks while empowering humans to apply creativity and critical thinking. The...

read
Mar 19, 2025

Swedish startup creates robot dog that learns like animals, not algorithms

Swedish startup IntuiCell has created a revolutionary robot dog called Luna with a digital nervous system that learns and adapts naturally like living organisms rather than relying on massive datasets or pre-training. This represents one of the first practical applications of physical agentic AI—artificial intelligence capable of making decisions and taking actions toward specific goals autonomously—and could transform how robots learn to navigate unpredictable environments, from space exploration to disaster response. The big picture: IntuiCell is pioneering an entirely different approach to robot learning by creating machines with nervous systems that learn through real-world interactions rather than pre-programmed responses or...

read
Mar 19, 2025

Got agency? Deloitte launches Zora AI platform with Nvidia to automate complex business tasks

Deloitte's new Zora AI platform represents a significant advancement in business automation technology, bringing autonomous AI agents powered by Nvidia's sophisticated models to enterprise operations. This launch signals the accelerating trend of purpose-built AI agents designed to handle complex business processes across finance, supply chain, and customer service sectors. As organizations increasingly seek intelligent automation to augment human capabilities, Zora AI's early deployment results—including substantial cost reductions and productivity gains—offer compelling evidence for the potential business impact of agentic AI systems. The big picture: Deloitte has unveiled Zora AI, an advanced platform featuring ready-to-deploy digital agents built on Nvidia's AI...

read
Mar 18, 2025

Keeping it real: 5 crucial business functions that should stay human in the AI era

In a business landscape increasingly dominated by AI automation, entrepreneurs face a crucial choice about which aspects of their operations should remain distinctly human. While AI tools promise efficiency and scalability, blindly outsourcing core business functions risks creating generic, forgettable companies that lose the authentic human elements that make them special. Understanding which tasks to entrust to humans versus AI has become a critical strategic decision that directly impacts a company's competitive advantage and brand identity. The big picture: Businesses are rapidly adopting AI agents to automate various functions, but this trend risks homogenizing companies and removing the unique elements...

read
Mar 18, 2025

Manus challenges OpenAI’s Operator with autonomous AI agent for complex tasks

Manus represents a new entry in the autonomous AI agent space, positioning itself as a competitor to OpenAI's Operator by promising to handle complex tasks without continuous human guidance. This technology aims to revolutionize productivity by automating everything from resume screening to real estate research, though the article's author maintains healthy skepticism about its real-world performance compared to established tools like ChatGPT Pro. What are AI agents: AI agents represent sophisticated automated assistants that go beyond simple chatbot functionality by making decisions and taking actions independently. These systems leverage reinforcement learning, neural networks, and natural language processing to process inputs...

read
Mar 18, 2025

Halliday raises $20M to build secure AI agents for blockchain networks

Halliday's $20 million Series A funding represents a significant step toward solving one of blockchain's most critical challenges: safely deploying autonomous AI agents in decentralized environments. Led by Andreessen Horowitz's crypto arm, this investment supports Halliday's innovative Agentic Workflow Protocol, which creates immutable safety guardrails for AI operating on blockchain networks. The technology addresses the fundamental issue of how AI can interact with financial systems while maintaining security—a crucial development for enterprises seeking to leverage AI in blockchain applications without risking costly and irreversible errors. The big picture: Halliday has raised $20 million in Series A funding led by a16z...

read
Mar 17, 2025

Qualtric Control: Company launches AI Experience Agents to solve customer problems autonomously

Qualtrics is revolutionizing customer experience management with AI agents that can autonomously resolve issues across multiple touchpoints. Unveiled ahead of the company's X4 2025 Experience Management Summit, these "Experience Agents" represent a significant evolution in AI capabilities by directly interacting with customers rather than merely streamlining internal processes. This development signals a shift in how businesses might handle customer service, combining speed with empathetic engagement that traditional chatbots often fail to deliver. The big picture: Qualtrics has developed AI agents that go beyond answering questions to actively resolve customer issues by interacting directly with consumers across various touchpoints including surveys,...

read
Mar 17, 2025

Mid-collar concerns: AI companies pivot to autonomous systems designed to replace human workers

Major AI companies are racing to deliver autonomous AI systems capable of performing complex tasks with minimal human supervision. This evolution of AI from merely answering questions to completing multi-step tasks independently represents a significant shift in the technology landscape. These new agent-like systems, designed to reason and work autonomously, could dramatically reshape productivity expectations and the future job market. The big picture: Silicon Valley's AI powerhouses including OpenAI, Anthropic, Google, and Microsoft are pivoting toward AI systems that can independently complete tasks rather than just augment human capabilities. Anthropic's Claude Code can perform much of a software developer's work...

read
Mar 14, 2025

7 steps to build your own custom ChatGPT AI agent for business automation

The creation of custom AI agents using ChatGPT represents a significant shift in project management, offering teams a way to automate routine tasks while focusing on higher-value work. With most organizations planning to integrate AI agents within three years, understanding how to build these custom solutions has become increasingly relevant for businesses seeking to improve efficiency and productivity in an environment where manual tracking and reporting consume valuable time. 1. Define your AI agent's purpose Before building an AI agent, you must clearly establish what problem it will solve and what specific tasks it will perform. This foundational step ensures...

read
Mar 14, 2025

SoftBank to build $677 million AI data center with OpenAI for Japanese market

SoftBank's plan to transform a former Sharp LCD plant into a massive AI data center marks a significant step in Japan's AI infrastructure development. The $677 million acquisition of the Osaka facility represents a strategic collaboration with OpenAI to commercialize AI agents specifically for the Japanese market, potentially triggering a new wave of localized AI deployment and customization in Asia's advanced economies. The big picture: SoftBank intends to convert Sharp's closed TV LCD factory in Osaka into one of Japan's largest data centers through a ¥100 billion ($677 million) investment, according to a Nikkei report. The facility will power AI...

read
Mar 13, 2025

ServiceNow expands AI agent capabilities with Moveworks acquisition and new enterprise tools

Move it or lose it! Moveworks, that is... ServiceNow is aggressively expanding its AI agent capabilities through strategic acquisitions and platform enhancements to deliver more value across enterprise operations. By acquiring Moveworks and introducing specialized agents for security, change management, and network operations, the company aims to streamline workflows and automate repetitive tasks. This expansion reflects ServiceNow's vision that AI agents are becoming essential infrastructure components for modern businesses rather than just optional tools. The big picture: ServiceNow announced the acquisition of Moveworks alongside new agent capabilities and the general availability of its orchestration platform, signaling deeper investment in its...

read
Mar 12, 2025

Yo Quiero Taco Bell AI: Fast food icon embraces agentic automation

Agentic AI is moving from theoretical applications to real-world implementation in the fast food industry as YUM Brands unveils virtual management technology for Taco Bell restaurants. This significant development represents the next evolution of AI beyond generative chatbots, creating systems capable of handling complex, multi-step tasks with minimal human intervention. As a trillion-dollar industry driven by technological innovation, fast food is becoming a proving ground for AI that can make decisions, manage operations, and potentially reshape how restaurants function. The big picture: YUM Brands is deploying agentic AI restaurant managers across its Taco Bell franchise through its Byte By Yum...

read
Mar 12, 2025

Manus AI agent put to the test, outperforms single-system chatbots

Manus, Butterfly Effect's new AI agent, marks a significant evolution in autonomous AI capabilities by coordinating multiple AI models to handle complex tasks. Unlike single-model chatbots, this general AI agent distributes work across independently operating systems, creating what early users describe as a highly capable digital assistant. Despite minimal access—fewer than 1% of waitlisted users have received invites—Manus has already generated substantial buzz in both Chinese and Western tech communities, with influential figures like Jack Dorsey praising its performance. The big picture: Manus represents a different approach to AI by functioning as a multi-model agent system rather than relying on...

read
Load More