NVIDIA GTC 2025 Part 1: Keynote and Main Announcements

I attended NVIDIA’s GTC conference in San Jose this year, from March 16th to 21st, and as expected, it was a lot of fun and a very inspiring experience for anyone who is a tech practitioner or is interested in accelerated hardware and CUDA software ecosystem.

At GTC 2025, San Jose McEnery Convention Center

This is Part 1 of a 3-part series covering GTC 2025:

Part 1: Keynote and Main Announcements (this post)
Part 2: Deep Dive into CUDA
Part 3: Exhibit Hall - Hardware and Robotics

Main Highlights

GTC Keynote with Jensen Huang + live Acquired podcast
“CUDA, New Features and Beyond” with Stephen Jones gives overview of CUDA-X libraries + announces CuTile programming paradigm
Multiple deep-dive CUDA sessions on bandwidth, latency, compute optimization and cuda kernel writing
4 hours C++ workshop using the Thrust library
GPU acceleration for Apache Spark, with Project Aether for job migration between cpu and gpu
Exhibit hall showcasing the latest hardware, robotics, and medical equipment

The conference seemed to be more packed than last year, and extended beyond the convention center / Marriott / Hyatt, to the nearby Civic Center and Montgomery Theater for sessions, as well as additional outside tents at the nearby park to showcase exhibitors.

GTC Keynote

Selfie at the keynote with the Acquired podcast on screen

The keynote summarized the trend and direction of accelerated computing, which continues to “eat” cpu-only software applications, and spread to more and more fields, from hard sciences, to wireless communications, to industrial robotics, to enterprise AI.

Training and inference workloads for large language models remains a huge focus for the next generation of high performance accelerated hardware, with the argument that agentic AI will require “10-100x more compute” for inference than traditional one-shot prompts. Data center energy constraints also drive the need for more efficient hardware.

Nvidia sneak-peeked the next generation Exaflops compute hardware, Vera Rubin NVL144 for 2026 and Rubin Ultra NVL576 for 2027. Feynman will be the name of the architecture following Rubin.

NVIDIA AI Infrastructure for Enterprise Computing

Other key announcements included Isaac Groot N1 humanoid model, Newton physics engine, Grace-Blackwell DGX systems, and photonics-based Spectrum-X for scaled clusters.

GTC 2025 keynote highlights

Continue to Part 2: Deep Dive into CUDA to learn about the CUDA workshops, CuTile programming paradigm, and the comprehensive CUDA-X library ecosystem.