NVIDIA and AWS collaborate to provide scalable, low-latency AI infrastructure with new EC2 G7 instances featuring Blackwell GPUs, GPU-accelerated vector indexing in OpenSearch Serverless powered by cuVS, and AWS achieving NVIDIA Exemplar Cloud status for GB300 training.
EC2 G7 instances with NVIDIA RTX PRO 4500 Blackwell GPUs deliver up to 4.6x AI inference performance.
OpenSearch Serverless defaults to GPU-accelerated vector indexing, achieving 10x faster indexing at a quarter of the cost.
Companies are moving from experimenting with frontier models to building specialized AI agents tailored to their workflows. The NVIDIA Agent Toolkit provides open, modular building blocks—models, tools, skills, and a secure runtime—to create customizable and trustworthy digital coworkers.
The second wave of enterprise AI focuses on specialized agents for complex workflows.
NVIDIA Agent Toolkit includes Nemotron models, NemoClaw blueprints, and OpenShell runtime.
NVIDIA technology runs 81% of the TOP500 and 90% of the systems new to the list. 26 systems on the TOP500 adopted the NVIDIA Grace CPU, up eight from the previous list. The top eight systems on the Green500 run on NVIDIA GPUs and nine of the top 10 use NVIDIA technologies. No. 1 on the Green500, KAIROS, uses a single NVIDIA Grace Hopper Superchip. 376 of the TOP500 systems are interconnected using NVIDIA networking. A record 35 NVIDIA AI HPC supercomputers are in development across Europe.
NVIDIA powers 81% of TOP500 supercomputers and 90% of new entries.
Grace CPU adoption grows to 26 systems, up 8 from prior list.
Telecom operators have seen remarkable returns from generative AI in network management, customer care, and back-office automation. The industry is now moving toward autonomous networks where AI agents proactively monitor and coordinate changes. At TM Forum’s DTW Ignite 2026, NVIDIA and partners showcase building blocks including synthetic data, domain models, secure runtimes, and simulation, enabling more resilient networks and richer AI-driven services.
NVIDIA and partners present a telecom autonomy platform with synthetic data, telecom-domain models, secure agent runtimes, and simulation.
SoftBank, Amdocs, NTT DATA, and others pilot long-running autonomous agents for self-healing networks, proactive customer care, and anomaly detection.
JUPITER, Europe’s first exascale supercomputer at Germany’s Forschungszentrum Jülich, runs on NVIDIA Grace Hopper Superchips and NVIDIA Quantum-X800 InfiniBand networking — and it’s had a busy year. Four projects running on JUPITER demonstrate what exascale computing can actually do: map the human brain at cellular scale, simulate the entire Earth’s climate at 1-kilometer resolution, build AI systems for next-generation wireless networks, and simulate a universal 50-qubit quantum computer.
JUPITER is Europe's first exascale supercomputer, powered by NVIDIA Grace Hopper and Quantum-X800 InfiniBand.
Four frontier scientific projects on JUPITER span brain mapping, climate simulation, 6G AI, and quantum computing.
The U.S. National Science Foundation's NAIRR pilot program has driven over 700 projects in two years, with NVIDIA providing DGX nodes and technical support. Highlights include Polymathic AI's fluid simulations, University of Michigan's fusion model for energy storage, and Boston University's BEACON pipeline for infectious disease detection.
NVIDIA contributes DGX nodes and technical support to NAIRR, accelerating research
Polymathic AI develops Walrus foundation model for fluid simulations
At ISC Hamburg, NVIDIA introduced DAQIRI, ALCHEMI NIM, and cuPhoton software to accelerate science. cuPhoton achieved up to 14,900x speedup in astronomical data processing, while DAQIRI enables real-time AI on high-speed data streams. ALCHEMI accelerates materials simulation, with Lila Sciences using it to speed up materials screening by 50x.
NVIDIA introduces new software at ISC: DAQIRI, ALCHEMI NIM, and cuPhoton to accelerate AI for science.
cuPhoton speeds FITS data processing by up to 14,900x for loading/reading, enabling faster dark matter and dark energy research.
Los Alamos National Laboratory (LANL) is building three new supercomputers with HPE and NVIDIA, powered by NVIDIA Vera CPUs, to accelerate scientific discovery and enable agentic AI for science. The systems—Mission, Vision, and Veritas—use the HPE Cray GX5000 architecture with the NVIDIA Vera Rubin platform. Early tests show Vera CPUs delivering 7x higher performance on URSA workloads and over 3x on heat transfer simulations compared to Crossroads x86 CPUs. Mission, expected in 2027, will replace Crossroads for classified national security work, while Vision will support fundamental science.
LANL to deploy three new supercomputers: Mission, Vision, and Veritas, all using NVIDIA Vera CPUs.
Vera CPU delivers 7x performance boost on URSA workloads and over 3x on Branson simulations.
AI's growth is energy-hungry. Eco Wave Power uses NVIDIA AI and digital twins to convert ocean wave energy into clean electricity, leveraging existing coastal infrastructure. Wave energy is abundant, less intermittent, and can potentially power AI factories and data centers.
Eco Wave Power uses NVIDIA AI and digital twins to harness wave energy via existing marine infrastructure.
Wave energy could provide over 60% of U.S. annual electricity consumption, and is more reliable than solar or wind.
NVIDIA's new Rubin generation AI servers achieve 100% liquid cooling with coolant temperatures up to 45°C, hotter than a hot tub. This design significantly improves energy efficiency by reducing cooling energy consumption and water usage. In favorable climates, chiller-less operation is possible, nearly eliminating water consumption. Traditional data centers allocate up to 40% of electricity to cooling, but liquid cooling can slash costs.
NVIDIA Rubin AI servers are the first to achieve 100% liquid cooling, with coolant up to 45°C.
Liquid cooling drastically reduces cooling energy use, saving over $4 million annually in a 50 MW hyperscale facility.
In a consequential grid infrastructure decision, the Federal Energy Regulatory Commission (FERC) today issued a major milestone on large-load interconnection impacting how those building AI factories, semiconductor fabrication support systems and advanced manufacturing facilities can connect to the grid. The new framework streamlines the interconnection queue, allowing large customers to fund upgrades, bring new generation, and offer flexible load, with study periods as short as 60 days. Data shows that increasing consumption can lower retail prices. This policy promotes growth, affordability, and reliability.
FERC's new rules allow large customers to self-fund grid upgrades, reducing cost pressure on existing ratepayers
Flexible load customers can accelerate interconnection with study periods as short as 60 days
The digital era gave the advertising and marketing industry speed; the AI era is giving it autonomous operations. At Cannes Lions, companies including Alembic, AWS, Criteo, Higgsfield, KERV.ai, and Taboola showcase how NVIDIA technologies enable faster, autonomous operations at enterprise scale.
Causal AI platform Alembic uses NVIDIA DGX Vera Rubin systems to scale enterprise causal modeling, proving actual growth drivers.
AWS integrates NVIDIA GPU acceleration with cloud infrastructure for real-time AI-powered bidding in ad auctions.
A year after announcing ambitious AI plans at NVIDIA GTC Paris, France’s AI infrastructure is coming online: AI agents are in production, startups are deploying applications, and the ecosystem is developing models and platforms tailored to local languages and European requirements. Key developments include Mistral’s new data center, open model collaborations via NVIDIA Nemotron, and enterprise AI adoption across healthcare, telecom, automotive, energy, and cosmetics industries.
France’s AI infrastructure is taking shape with Mistral’s 44MW data center and plans for 200MW by 2027. Other investments include Scaleway’s Blackwell instances, Bull/Foxconn production, and a bid for a European AI gigafactory.
Open models are central: NVIDIA Nemotron Coalition partners like Mistral, LINAGORA, H Company, and Pleias are developing models for local languages and EU compliance.
NVIDIA XR AI is now available in public beta, giving developers a framework for building multimodal AI agents for AR glasses and XR devices. The platform integrates core capabilities for ingesting device signals, connecting to enterprise tools, supporting diverse AI models, and orchestrating agents, with applications already emerging in manufacturing, science, healthcare, design, and immersive learning.
NVIDIA XR AI enables developers to build spatially aware, multimodal AI agents for AR glasses and XR devices.
The platform includes four core capabilities: real-world signal ingestion, tool and service connections, broad AI model support, and agent orchestration with accelerated runtime.
Coherent broke ground today on an expanded manufacturing building in Sherman, Texas, to scale production of indium phosphide wafers and optical components essential for AI infrastructure. NVIDIA CEO Jensen Huang attended, highlighting the strategic partnership. A $50 million CHIPS Act grant supports the project, which is expected to create over 550 jobs and strengthen domestic semiconductor manufacturing.
Coherent broke ground on an expanded manufacturing facility in Sherman, Texas, to boost optical component production for AI.
The project is backed by a $50 million CHIPS Act grant and will create over 550 jobs.
Enterprises are moving agentic AI from proof of concept to production — and the next generation of AI factories are built for the era of agents. At HPE Discover Las Vegas, NVIDIA and HPE are expanding the HPE AI Factory with NVIDIA, including NVIDIA Vera CPU and NVIDIA Agent Toolkit for HPE Private Cloud AI. NVIDIA Confidential Computing extends across HPE AI Factory and enhanced full-stack NVIDIA integration is available throughout the portfolio.
NVIDIA Vera CPU, built for agents, will be available with HPE Private Cloud AI in 2027.
NVIDIA Agent Toolkit now available with HPE Private Cloud AI, providing an agentic AI operating system.
NVIDIA Blackwell platform achieves fastest training times across all benchmarks in MLPerf Training 6.0, demonstrates large-scale training with up to 8,192 GPUs, and highlights reliability features.
NVIDIA Blackwell is the only platform to submit results across all seven MLPerf Training 6.0 benchmarks, achieving the fastest time for each.
GB300 NVL72 delivers up to 1.6x faster training than GB200 NVL72 at the same scale.
Artificial Analysis released AgentPerf, the industry's first benchmark for agentic AI. Initial results show NVIDIA Blackwell Ultra NVL72 leading, running 20x more agents per megawatt than Hopper. The benchmark measures how many concurrent agentic tasks a platform can support under real-world coding agent workloads.
AgentPerf is the first benchmark designed for agentic AI workloads, focusing on chained LLM calls and tool calls.
NVIDIA GB300 NVL72 delivers 20x more agents per megawatt than H200 on DeepSeek V4 Pro model.
NVIDIA's GeForce NOW summer sale offers up to $70 off a 12-month Ultimate membership and $35 off a Performance membership. The cloud gaming service eliminates hardware barriers, provides instant access to high-performance RTX gaming across devices, and announces Guild Wars 3 coming to the platform with exclusive rewards for current Guild Wars titles.
GeForce NOW summer sale: $70 off Ultimate and $35 off Performance annual memberships for a limited time.
Cloud gaming removes hardware constraints, offering instant game access, automatic updates, and cross-device play.
As robotaxi services expand globally, NVIDIA introduces Halos OS—a comprehensive safety system integrating certified OS, standardized interfaces, AI guardrails, and a validation framework to ensure safety is built into autonomous vehicles from the ground up.
Multiple robotaxi programs are launching worldwide using NVIDIA DRIVE Hyperion, including Uber/Autobrains in Munich, Foxconn in Taiwan, VinFast in Southeast Asia, and HUMAIN in Saudi Arabia.
NVIDIA Halos OS addresses four key safety challenges: a safety-certifiable operating system, safe interfaces, AI with verifiable guardrails, and validation at scale.
Google DeepMind released DiffusionGemma, an experimental open model for fast text generation using parallel token generation. NVIDIA optimized it to run faster on GeForce RTX, RTX PRO, and DGX Spark systems, achieving up to 1000 tokens/sec locally.
DiffusionGemma generates up to 256 tokens in parallel per step, unlike traditional autoregressive models. Based on Gemma 4 (26B parameters, MoE), activating only 3.8B per step. Up to 4x faster performance. Open source under Apache 2.0, runs locally with no cloud dependency.
NVIDIA GPUs with Confidential Computing are now used for confidential inference in Apple's Private Cloud Compute (PCC), which is expanding from Apple's own data centers to Google Cloud. The technology provides a hardware-based security layer to protect user data during processing.
NVIDIA Confidential Computing GPUs now used in Apple Private Cloud Compute
One year after NVIDIA CEO Jensen Huang and UK PM Keir Starmer declared the UK would be an AI maker, NVIDIA and partners showcase progress at London Tech Week. Key developments include doubling of sovereign AI cloud providers, Isambard-AI supercomputer, Sovereign AI Fund backing startups, and enterprise AI deployments across sectors.
Number of AI cloud providers planning UK deployments has doubled; Nebius, CoreWeave, BT and Nscale announce new infrastructure.
Isambard-AI, the UK's most powerful computer with 5,400 NVIDIA GH200 superchips, powers ambitious AI research.
NVIDIA and LG Group are building an AI factory to accelerate LG's AI-driven businesses in robotics, autonomous driving, data center technologies, and GPU cloud services. The collaboration integrates NVIDIA's full-stack AI factory platform with LG's leadership in consumer electronics and robotics, aiming to create a unified workflow for physical AI systems.
NVIDIA and LG collaborate on an AI factory covering robotics, autonomous driving, data centers, and GPU cloud.
LG Electronics will use NVIDIA Isaac Sim and Isaac Lab for home robots, and explore the GR00T model.
NVIDIA and Doosan Group are expanding their collaboration to advance new opportunities across physical AI, robotics and AI factory infrastructure, spanning Doosan Robotics, Doosan Bobcat, Doosan Enerbility and Doosan Corporation Electro-Materials BG.
Doosan Robotics integrates NVIDIA Isaac Sim and other platforms to advance Agentic Robot OS.
Doosan Bobcat plans to use NVIDIA physical AI for autonomous equipment.
At GTC Taipei at COMPUTEX last week, NVIDIA unveiled RTX Spark, the superchip that reinvents Windows PCs for the era of personal AI agents. On the heels of this announcement, NVIDIA founder and CEO Jensen Huang headed to South Korea, where he introduced RTX Spark to the nation’s passionate gaming community. Leading game developers — including Korea’s KRAFTON and NC — are already working to bring their titles to RTX Spark-powered systems.
NVIDIA unveiled RTX Spark superchip for AI, creation, and gaming, supporting AAA games at 1440p over 100 fps.
Jensen Huang met with T1’s LoL world champions including Faker, and showcased RTX Spark at T1 Base Camp.
NVIDIA CEO Jensen Huang visits Seoul this week to meet partners and builders behind South Korea's AI ecosystem, focusing on AI supply chain, robotics, and physical AI opportunities.
Huang visits Seoul to align the AI supply chain ahead of a busy second half of the year.
Highlights progress on Grace Blackwell and Vera Rubin systems; urges Korea to invest in AI.
At CVPR, NVIDIA Research presents three papers tackling key challenges in robotic grasping, autonomous driving reasoning, and virtual agent training. GraspGen-X is the first foundation model for zero-shot grasping, adaptable to any gripper. LCDrive accelerates vehicle reasoning using compact latent representations. NitroGen, based on Isaac GR00T, trains embodied agents in virtual environments over thousands of hours. The work emphasizes the importance of large-scale training for generalization.
GraspGen-X is the first foundation model for zero-shot grasping, trained on 2 billion simulated grasps to work with any gripper.
LCDrive replaces text-based reasoning with latent representations, achieving comparable trajectory quality with roughly half the tokens on embedded hardware.
At CVPR 2026, NVIDIA introduced new physical AI agent skills to accelerate development of autonomous vehicles, robots, and vision AI systems. These skills integrate with NVIDIA Cosmos 3, simulation frameworks, and libraries to automate workflows from scene reconstruction to policy training. Key advancements include Neural Reconstruction, AlpaGym, OmniDreams, Metropolis skills for defect generation, and new robotics tools in Isaac Sim. NVIDIA also released open models like Alpamayo 2 Super and datasets like GRAIL.
NVIDIA unveiled physical AI agent skills at CVPR to streamline end-to-end workflows for AV, robotics, and vision AI.
New tools include Neural Reconstruction, AlpaGym, OmniDreams, and Metropolis skills for synthetic data and scenario generation.
Accelerated computing has revolutionized industrial engineering, compressing simulation times from weeks to hours. Today’s remaining challenges sit in the end-to-end workflow surrounding the simulations: computer-aided design, meshing, simulation setup and debugging, as well as post-processing and generating summary reports of these processes. At GTC Taipei at COMPUTEX, NVIDIA and more than a dozen engineering software providers are showcasing how autonomous AI agents automate this entire workflow. These AI engineers are based on NVIDIA NemoClaw, an open blueprint for building specialized, long-running agents with a secure runtime and frontier models.
NVIDIA announces NemoClaw, an open blueprint for building secure, long-running autonomous AI engineers.
Cadence, Dassault Systèmes, Siemens, and Synopsys are integrating NemoClaw to automate design, simulation, and verification workflows.