×
NVIDIA Launches AI Microservices for Japan and Taiwan Markets
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

AI localization advances: NVIDIA has launched four new NIM microservices to accelerate the deployment of sovereign AI applications with enhanced cultural and language fluency in Japan and Taiwan.

  • The new microservices support popular community models tailored to meet regional needs, improving user interactions through more accurate understanding and responses based on local languages and cultural heritage.
  • This initiative aligns with the growing global trend of nations pursuing sovereign AI to ensure AI systems are in harmony with local values, laws, and interests.
  • ABI Research projects that generative AI software revenue in the Asia-Pacific region alone is expected to surge from $5 billion in 2024 to $48 billion by 2030, highlighting the significant market potential.

Specialized language models: The new NIM microservices include region-specific models trained on local data to provide deeper understanding of cultural nuances and linguistic subtleties.

  • Llama-3-Swallow-70B, trained on Japanese data, and Llama-3-Taiwan-70B, trained on Mandarin data, offer enhanced comprehension of local laws, regulations, and customs.
  • The RakutenAI 7B family of models, built on Mistral-7B and trained on English and Japanese datasets, is available as two separate NIM microservices for Chat and Instruct functionalities.
  • These models have demonstrated superior performance in regional language understanding, legal tasks, question-answering, and language translation compared to base LLMs like Llama 3.

Deployment and performance: The new NIM microservices, available with NVIDIA AI Enterprise, are designed for easy deployment and optimized performance.

  • The microservices are optimized for inference using the NVIDIA TensorRT-LLM open-source library, providing up to 5x higher throughput for Llama 3 70B-based models.
  • This optimization leads to lower operational costs and improved user experiences through decreased latency.
  • The microservices are currently available as hosted application programming interfaces (APIs), allowing for straightforward integration into existing systems.

Real-world applications: Various organizations across industries are already leveraging these new NIM microservices to enhance their AI capabilities.

  • The Tokyo Institute of Technology has fine-tuned Llama-3-Swallow 70B using Japanese-language data, emphasizing the importance of developing sovereign AI models that adhere to cultural norms.
  • Preferred Networks, a Japanese AI company, is using the model to develop a healthcare-specific model trained on Japanese medical data, which has shown top performance on the Japan National Examination for Physicians.
  • Chang Gung Memorial Hospital in Taiwan is building a custom AI Inference Service using Llama 3-Taiwan 70B to improve the efficiency of frontline medical staff and enhance patient care.
  • Taiwanese companies like Pegatron, Chang Chun Group, and Unimicron are adopting Llama 3-Taiwan 70B for various applications in manufacturing, operations, and business processes.

Customization capabilities: NVIDIA AI Foundry provides a platform for enterprises to further customize these regional models for specific business needs.

  • The AI Foundry platform includes popular foundation models, NVIDIA NeMo for fine-tuning, and dedicated capacity on NVIDIA DGX Cloud.
  • This full-stack solution allows developers to create customized foundation models packaged as NIM microservices, ensuring culturally and linguistically appropriate results for their users.
  • The platform also provides access to NVIDIA AI Enterprise software, offering security, stability, and support for production deployments.

Future implications: The launch of these localized AI microservices represents a significant step towards more culturally attuned and efficient AI systems.

  • As more countries invest in sovereign AI infrastructure, these tools will likely play a crucial role in developing AI applications that respect local values and regulations.
  • The ability to fine-tune models for specific industries and use cases could lead to more specialized and effective AI solutions across various sectors.
  • This development may also encourage further innovation in language-specific AI models, potentially leading to more diverse and inclusive AI technologies globally.
NVIDIA Launches NIM Microservices for Generative AI in Japan, Taiwan

Recent News

7 ways to optimize your business for ChatGPT recommendations

Companies must adapt their digital strategy with specific expertise, consistent information across platforms, and authoritative content to appear in AI-powered recommendation results.

Robin Williams’ daughter Zelda slams OpenAI’s Ghibli-style images amid artistic and ethical concerns

Robin Williams' daughter condemns OpenAI's AI-generated Ghibli-style images, highlighting both environmental costs and the contradiction with Miyazaki's well-documented opposition to artificial intelligence in creative work.

AI search tools provide wrong answers up to 60% of the time despite growing adoption

Independent testing reveals AI search tools frequently provide incorrect information, with error rates ranging from 37% to 94% across major platforms despite their growing popularity as Google alternatives.