×
Nvidia launches NeMo to simplify AI agent creation
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

Nvidia’s NeMo microservices mark a significant evolution in enterprise AI implementation, providing organizations with practical tools to develop AI agents that continuously improve through data interactions. As businesses seek to demonstrate ROI on substantial AI investments, these microservices address a fundamental challenge: creating AI systems that remain relevant by continuously learning from enterprise data and user interactions.

The big picture: Nvidia has released five NeMo microservices designed to help enterprises build AI agents that integrate with business systems and improve through ongoing data interactions.

  • These specialized components create what Nvidia calls a “data flywheel,” enabling AI systems to maintain relevance through continuous exposure to organizational information.
  • Unlike basic chatbots, these agents can perform tasks with minimal human supervision, make autonomous decisions, and access current information within organizational boundaries.

Key components: The toolkit includes five microservices that work together to create AI agents that function as digital teammates.

  • NeMo Customizer enhances LLM fine-tuning with improved training throughput.
  • NeMo Evaluator simplifies assessment of AI models against custom benchmarks.
  • NeMo Guardrails implements safety controls to maintain compliance and appropriate responses.
  • NeMo Retriever enables information access across enterprise systems.
  • NeMo Curator processes and organizes data for model training and improvement.

Technical architecture: The microservices operate as Docker containers orchestrated through Kubernetes, supporting integration with multiple AI models.

  • According to Joey Conway, Nvidia’s senior director of generative AI software for enterprise, “NIMs is used for inference deployments – running the model, questions-in, responses-out. NeMo is focused on how to improve that model.”
  • The containerized approach allows deployment in various environments, addressing data sovereignty and regulatory compliance concerns.
  • The system supports multiple AI models including Meta’s Llama, Microsoft’s Phi family, Google’s Gemma, Mistral, and Nvidia’s Llama Nemotron Ultra.

Real-world impact: Early implementations demonstrate measurable business value across various organizations.

  • Amdocs has developed three specialized agents using the NeMo microservices.
  • AT&T built an agent that processes nearly 10,000 documents updated weekly.
  • Cisco’s Outshift created a coding assistant that delivers faster responses than comparable tools.

Why this matters: The release represents a meaningful step in enterprise AI maturation, bridging the gap between research capabilities and practical business implementations.

  • As organizations move toward production AI systems, tools that simplify the creation of continuously improving models become increasingly valuable.
  • The infrastructure reduces implementation complexity while providing flexibility to address various deployment scenarios and compliance requirements.
Nvidia Releases NeMo Microservices To Streamline AI Agent Development

Recent News

The role of AI in shaping future scientific breakthroughs

AI is moving beyond data analysis into conducting physical experiments, enabling it to learn causality and develop tacit knowledge crucial for autonomous scientific discovery.

How AI is shaping a new era of introspection

Interactive AI systems are creating a new paradigm for self-reflection by serving as cognitive mirrors rather than distractions from deep thinking.

Hostinger Horizons simplifies web development for non-coders

The AI-powered platform generates functioning web applications from natural language descriptions, removing technical barriers for non-programmers.