×
Nvidia launches NeMo to simplify AI agent creation
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

Nvidia’s NeMo microservices mark a significant evolution in enterprise AI implementation, providing organizations with practical tools to develop AI agents that continuously improve through data interactions. As businesses seek to demonstrate ROI on substantial AI investments, these microservices address a fundamental challenge: creating AI systems that remain relevant by continuously learning from enterprise data and user interactions.

The big picture: Nvidia has released five NeMo microservices designed to help enterprises build AI agents that integrate with business systems and improve through ongoing data interactions.

  • These specialized components create what Nvidia calls a “data flywheel,” enabling AI systems to maintain relevance through continuous exposure to organizational information.
  • Unlike basic chatbots, these agents can perform tasks with minimal human supervision, make autonomous decisions, and access current information within organizational boundaries.

Key components: The toolkit includes five microservices that work together to create AI agents that function as digital teammates.

  • NeMo Customizer enhances LLM fine-tuning with improved training throughput.
  • NeMo Evaluator simplifies assessment of AI models against custom benchmarks.
  • NeMo Guardrails implements safety controls to maintain compliance and appropriate responses.
  • NeMo Retriever enables information access across enterprise systems.
  • NeMo Curator processes and organizes data for model training and improvement.

Technical architecture: The microservices operate as Docker containers orchestrated through Kubernetes, supporting integration with multiple AI models.

  • According to Joey Conway, Nvidia’s senior director of generative AI software for enterprise, “NIMs is used for inference deployments – running the model, questions-in, responses-out. NeMo is focused on how to improve that model.”
  • The containerized approach allows deployment in various environments, addressing data sovereignty and regulatory compliance concerns.
  • The system supports multiple AI models including Meta’s Llama, Microsoft’s Phi family, Google’s Gemma, Mistral, and Nvidia’s Llama Nemotron Ultra.

Real-world impact: Early implementations demonstrate measurable business value across various organizations.

  • Amdocs has developed three specialized agents using the NeMo microservices.
  • AT&T built an agent that processes nearly 10,000 documents updated weekly.
  • Cisco’s Outshift created a coding assistant that delivers faster responses than comparable tools.

Why this matters: The release represents a meaningful step in enterprise AI maturation, bridging the gap between research capabilities and practical business implementations.

  • As organizations move toward production AI systems, tools that simplify the creation of continuously improving models become increasingly valuable.
  • The infrastructure reduces implementation complexity while providing flexibility to address various deployment scenarios and compliance requirements.
Nvidia Releases NeMo Microservices To Streamline AI Agent Development

Recent News

AI slashes compliance time 80% with Relyance’s data ‘x-ray vision’

Relyance's new platform provides visual tracking of data movement across enterprise systems, reducing compliance documentation time while addressing growing regulatory concerns around AI implementation.

DeepSeek’s efficiency breakthrough shakes up the AI race

Chinese AI firm achieves remarkable efficiency with its R1 language model, demonstrating how resource constraints can inspire technical innovations that reduce computing requirements for advanced AI.

The illusion of AI suffering in self-portraits

The dramatic imagery of chains and existential horror reflects predictive patterns from training data rather than actual consciousness or emotional states.