Companies in search of to harness the facility of AI want custom-made fashions tailor-made to their particular {industry} wants.
NVIDIA AI Foundry is a service that allows enterprises to make use of knowledge, accelerated computing and software program instruments to create and deploy customized fashions that may supercharge their generative AI initiatives.
Simply as TSMC manufactures chips designed by different firms, NVIDIA AI Foundry gives the infrastructure and instruments for different firms to develop and customise AI fashions — utilizing DGX Cloud, basis fashions, NVIDIA NeMo software program, NVIDIA experience, in addition to ecosystem instruments and assist.
The important thing distinction is the product: TSMC produces bodily semiconductor chips, whereas NVIDIA AI Foundry helps create customized fashions. Each allow innovation and hook up with an enormous ecosystem of instruments and companions.
Enterprises can use AI Foundry to customise NVIDIA and open neighborhood fashions, together with the brand new Llama 3.1 assortment, in addition to NVIDIA Nemotron, CodeGemma by Google DeepMind, CodeLlama, Gemma by Google DeepMind, Mistral, Mixtral, Phi-3, StarCoder2 and others.
Trade Pioneers Drive AI Innovation
Trade leaders Amdocs, Capital One, Getty Photos, KT, Hyundai Motor Firm, SAP, ServiceNow and Snowflake are among the many first utilizing NVIDIA AI Foundry. These pioneers are setting the stage for a brand new period of AI-driven innovation in enterprise software program, expertise, communications and media.
“Organizations deploying AI can achieve a aggressive edge with customized fashions that incorporate {industry} and enterprise data,” stated Jeremy Barnes, vp of AI Product at ServiceNow. “ServiceNow is utilizing NVIDIA AI Foundry to fine-tune and deploy fashions that may combine simply inside prospects’ present workflows.”
The Pillars of NVIDIA AI Foundry
NVIDIA AI Foundry is supported by the important thing pillars of basis fashions, enterprise software program, accelerated computing, knowledgeable assist and a broad companion ecosystem.
Its software program consists of AI basis fashions from NVIDIA and the AI neighborhood in addition to the whole NVIDIA NeMo software program platform for fast-tracking mannequin growth.
The computing muscle of NVIDIA AI Foundry is NVIDIA DGX Cloud, a community of accelerated compute sources co-engineered with the world’s main public clouds — Amazon Net Providers, Google Cloud and Oracle Cloud Infrastructure. With DGX Cloud, AI Foundry prospects can develop and fine-tune customized generative AI purposes with unprecedented ease and effectivity, and scale their AI initiatives as wanted with out important upfront investments in {hardware}. This flexibility is essential for companies seeking to keep agile in a quickly altering market.
If an NVIDIA AI Foundry buyer wants help, NVIDIA AI Enterprise specialists are readily available to assist. NVIDIA specialists can stroll prospects by means of every of the steps required to construct, fine-tune and deploy their fashions with proprietary knowledge, guaranteeing the fashions tightly align with their enterprise necessities.
NVIDIA AI Foundry prospects have entry to a world ecosystem of companions that may present a full vary of assist. Accenture, Deloitte, Infosys and Wipro are among the many NVIDIA companions that supply AI Foundry consulting providers that embody design, implementation and administration of AI-driven digital transformation initiatives. Accenture is first to supply its personal AI Foundry-based providing for customized mannequin growth, the Accenture AI Refinery framework.
Moreover, service supply companions akin to Information Monsters, Quantiphi, Slalom and SoftServe assist enterprises navigate the complexities of integrating AI into their present IT landscapes, guaranteeing that AI purposes are scalable, safe and aligned with enterprise goals.
Prospects can develop NVIDIA AI Foundry fashions for manufacturing utilizing AIOps and MLOps platforms from NVIDIA companions, together with Cleanlab, DataDog, Dataiku, Dataloop, DataRobot, Domino Information Lab, Fiddler AI, New Relic, Scale and Weights & Biases.
Prospects can output their AI Foundry fashions as NVIDIA NIM inference microservices — which embrace the customized mannequin, optimized engines and a normal API — to run on their most popular accelerated infrastructure.
Inferencing options like NVIDIA TensorRT-LLM ship improved effectivity for Llama 3.1 fashions to attenuate latency and maximize throughput. This allows enterprises to generate tokens sooner whereas lowering complete price of working the fashions in manufacturing. Enterprise-grade assist and safety is supplied by the NVIDIA AI Enterprise software program suite.

The broad vary of deployment choices consists of NVIDIA-Licensed Techniques from world server manufacturing companions together with Cisco, Dell Applied sciences, Hewlett Packard Enterprise, Lenovo and Supermicro, in addition to cloud cases from Amazon Net Providers, Google Cloud and Oracle Cloud Infrastructure.
Moreover, Collectively AI, a number one AI acceleration cloud, as we speak introduced it would allow its ecosystem of over 100,000 builders and enterprises to make use of its NVIDIA GPU-accelerated inference stack to deploy Llama 3.1 endpoints and different open fashions on DGX Cloud.
“Each enterprise working generative AI purposes desires a sooner consumer expertise, with better effectivity and decrease price,” stated Vipul Ved Prakash, founder and CEO of Collectively AI. “Now, builders and enterprises utilizing the Collectively Inference Engine can maximize efficiency, scalability and safety on NVIDIA DGX Cloud.”
NVIDIA NeMo Speeds and Simplifies Customized Mannequin Growth
With NVIDIA NeMo built-in into AI Foundry, builders have at their fingertips the instruments wanted to curate knowledge, customise basis fashions and consider efficiency. NeMo applied sciences embrace:
- NeMo Curator is a GPU-accelerated data-curation library that improves generative AI mannequin efficiency by getting ready large-scale, high-quality datasets for pretraining and fine-tuning.
- NeMo Customizer is a high-performance, scalable microservice that simplifies fine-tuning and alignment of LLMs for domain-specific use instances.
- NeMo Evaluator gives automated evaluation of generative AI fashions throughout tutorial and customized benchmarks on any accelerated cloud or knowledge heart.
- NeMo Guardrails orchestrates dialog administration, supporting accuracy, appropriateness and safety in sensible purposes with giant language fashions to offer safeguards for generative AI purposes.
Utilizing the NeMo platform in NVIDIA AI Foundry, companies can create customized AI fashions which are exactly tailor-made to their wants. This customization permits for higher alignment with strategic goals, improved accuracy in decision-making and enhanced operational effectivity. As an example, firms can develop fashions that perceive industry-specific jargon, adjust to regulatory necessities and combine seamlessly with present workflows.
“As a subsequent step of our partnership, SAP plans to make use of NVIDIA’s NeMo platform to assist companies to speed up AI-driven productiveness powered by SAP Enterprise AI,” stated Philipp Herzig, chief AI officer at SAP.
Enterprises can deploy their customized AI fashions in manufacturing with NVIDIA NeMo Retriever NIM inference microservices. These assist builders fetch proprietary knowledge to generate educated responses for his or her AI purposes with retrieval-augmented technology (RAG).
“Secure, reliable AI is a non-negotiable for enterprises harnessing generative AI, with retrieval accuracy instantly impacting the relevance and high quality of generated responses in RAG programs,” stated Baris Gultekin, Head of AI, Snowflake. “Snowflake Cortex AI leverages NeMo Retriever, a part of NVIDIA AI Foundry, to additional present enterprises with simple, environment friendly, and trusted solutions utilizing their customized knowledge.”
Customized Fashions Drive Aggressive Benefit
One of many key benefits of NVIDIA AI Foundry is its potential to handle the distinctive challenges confronted by enterprises in adopting AI. Generic AI fashions can fall in need of assembly particular enterprise wants and knowledge safety necessities. Customized AI fashions, alternatively, supply superior flexibility, adaptability and efficiency, making them preferrred for enterprises in search of to realize a aggressive edge.
Be taught extra about how NVIDIA AI Foundry permits enterprises to spice up productiveness and innovation.