Microsoft Azure Gets New Nvidia Service For Developing Custom GenAI Apps

Nvidia AI Foundry gives businesses a collection of ‘commercially viable’ generative AI foundation models, developed by Nvidia and third parties, they can customize using Nvidia’s NeMo framework and tools on the company’s DGX Cloud AI supercomputing service hosted by Microsoft Azure.

ARTICLE TITLE HERE

Microsoft Azure is getting a new service from Nvidia that is designed to let businesses develop, fine-tune and run custom generative AI applications using proprietary data.

Announced at Microsoft Ignite 2023 on Wednesday, the new service is called Nvidia AI Foundry, and it gives businesses a collection of “commercially viable” GenAI foundation models, developed by Nvidia and third parties, they can customize using Nvidia’s NeMo framework and tools.

unit-1659132512259

type

How Nvidia AI Foundry Works

Nvidia AI Foundry is part of the AI chip giant’s bid to become what CEO Jensen Huang describes as a “full-stack computing company.” This translates into the chip designer providing the essential hardware and software for powering advanced computers, from GPUs, CPUs and networking components, to systems and reference designs, to various layers of software.

At the foundation of Nvidia AI Foundry is the DGX Cloud service. Announced earlier this year, the service combines Nvidia’s Base Command software platform and the Nvidia AI Enterprise software suite with the company’s DGX supercomputer systems, which are powered by either its flagship H100 data center GPUs or the previous-generation A100 GPUs.

Base Command manages and monitors AI training workloads, and it also lets users right-size the infrastructure they need for such workloads. AI Enterprise, on the other hand, comes with the AI frameworks and tools needed for developing and deploying AI applications.

“Importantly, customers of DGX Cloud have the capability to work directly with Nvidia engineering to optimize their workloads when they come and work on DGX Cloud,” Das said.

To let businesses develop custom GenAI applications, Nvidia is offering two additional components as part of Nvidia AI Foundry. The first is collection of what the company calls Nvidia AI Foundation models, which consists of large language models developed by Nvidia that businesses can customize.

The AI Foundation collection includes Nvidia’s new Nemotron-3 8B models, which includes versions tuned for different use cases such as chatbots and have multi-lingual capabilities.

“It is a mission for Nvidia on behalf of enterprise customers now to continually produce and update these variants of these models, both in terms of parameter size, in terms of the datasets they’re trained on in terms of the capabilities, and all with responsibly sourced data that we are able to share with our enterprise customers so that they know the antecedents of the model,” Das said.

Nvidia is also making available third-party AI models such as Meta’s Llama 2 that have been optimized to run on the company’s hardware and software.

To customize these models, businesses can use Nvidia’s NeMo framework and tools, which includes methods for extracting proprietary data sets to use in models, fine-tuning those models and applying guardrails to ensure proper and safe use of those models.

DGX Cloud and AI Enterprise are available on Azure Marketplace. AI Enterprise, which includes NeMo, is also integrated into Microsoft’s Azure Machine Learning service. The models developed by Nvidia and third parties are available in the Azure AI Model catalog.