NVIDIA Unlocks AI Compute at Scale, Inviting Capital Partners to Power the AI Infrastructure Buildout
As AI shifts from development to production inference, compute demand is growing and moving to continuously operating AI factories. NVIDIA introduces a new strategy to provide large-scale accelerated computing access to startups and enterprises through a revenue-sharing model, with initial deployments by Sharon AI and Firmus.
As AI moves from model development to production inference, compute demand is accelerating and shifting toward continuously operating AI factories that generate tokens at scale. This shift requires access to large‑scale, multi‑tenant accelerated computing that can come online quickly, stay highly utilized and support the economics of token‑scale AI services.
Emerging AI companies historically have had limited access to capital-intensive infrastructure, with even long-term commitments insufficient to unlock financing for compute.
To address this, NVIDIA is introducing a new strategy that opens up compute access to the fast‑growing AI ecosystem of startups, model builders, enterprises, research organizations and regional AI players. These AI clouds get access to large‑scale NVIDIA infrastructure while aligning economics through a revenue-sharing and credit-support model.
Through this new model with NVIDIA, AI cloud companies will sell cloud services delivered through NVIDIA DSX AI factories that manufacture tokens at scale. This accelerates the adoption of NVIDIA platforms among customers, while giving these clouds a capital‑efficient path to scale and providing NVIDIA with a new recurring, usage‑linked earnings stream.
For model builders, inference providers, agent platforms and enterprises scaling AI, it can mean faster access to full-stack accelerated computing without waiting through site selection, power procurement, construction and hardware bring-up.
NVIDIA AI Factory Capacity Built Around Demand
The initiative is already taking shape, with AI cloud companies building DSX AI factories designed to serve customers and workloads across regions.
Sharon AI and Firmus are among the first companies to work with NVIDIA on this strategy.
Sharon AI is deploying up to 40,000 NVIDIA Grace Blackwell GB300 GPUs.
“This strategic collaboration with NVIDIA marks a pivotal moment in Sharon AI’s mission to deliver sovereign, large-scale AI compute infrastructure,” said James Manning, cofounder and CEO of Sharon AI.
Firmus is building a DSX AI factory campus in Batam, Indonesia. The campus is expected to scale to 360 megawatts and up to 170,000 NVIDIA GPUs.
“AI-native companies need access to scalable, energy- and cost-efficient compute infrastructure to compete globally,” said Tim Rosenfield, co-CEO of Firmus Technologies. “Firmus AI cloud is building a NVIDIA DSX-aligned AI factory, which will enable our cloud to help more customers access the compute they need to build and scale AI.”
AI natives such as Baseten, Fireworks AI and Together AI show where compute demand is headed: they need immediate access to AI cloud capacity to run model training, post-training, fine-tuning and high-volume agentic inference for developers, digital natives and enterprises building with AI.
Their customers need reliable access to large-scale NVIDIA accelerated computing as usage grows, but they also need commercial flexibility as products move from pilot to production.
To secure compute capacity and build and deploy AI models, contact Sharon AI and Firmus.
Learn more about NVIDIA Cloud Partners and AI factories.