NVIDIA GTC 2025 Part 1: Keynote and Main Announcements
I attended NVIDIA’s GTC conference in San Jose this year, from March 16th to 21st, and as expected, it was a lot of fun and a very inspiring experience for anyone who is a tech practitioner or is interested in accelerated hardware and CUDA software ecosystem.

This is Part 1 of a 3-part series covering GTC 2025:
- Part 1: Keynote and Main Announcements (this post)
- Part 2: Deep Dive into CUDA
- Part 3: Exhibit Hall - Hardware and Robotics
Main Highlights
- GTC Keynote with Jensen Huang + live Acquired podcast
- “CUDA, New Features and Beyond” with Stephen Jones gives overview of CUDA-X libraries + announces CuTile programming paradigm
- Multiple deep-dive CUDA sessions on bandwidth, latency, compute optimization and cuda kernel writing
- 4 hours C++ workshop using the Thrust library
- GPU acceleration for Apache Spark, with Project Aether for job migration between cpu and gpu
- Exhibit hall showcasing the latest hardware, robotics, and medical equipment
The conference seemed to be more packed than last year, and extended beyond the convention center / Marriott / Hyatt, to the nearby Civic Center and Montgomery Theater for sessions, as well as additional outside tents at the nearby park to showcase exhibitors.
GTC Keynote
The keynote summarized the trend and direction of accelerated computing, which continues to “eat” cpu-only software applications, and spread to more and more fields, from hard sciences, to wireless communications, to industrial robotics, to enterprise AI.
Training and inference workloads for large language models remains a huge focus for the next generation of high performance accelerated hardware, with the argument that agentic AI will require “10-100x more compute” for inference than traditional one-shot prompts. Data center energy constraints also drive the need for more efficient hardware.
Nvidia sneak-peeked the next generation Exaflops compute hardware, Vera Rubin NVL144 for 2026 and Rubin Ultra NVL576 for 2027. Feynman will be the name of the architecture following Rubin.
Other key announcements included Isaac Groot N1 humanoid model, Newton physics engine, Grace-Blackwell DGX systems, and photonics-based Spectrum-X for scaled clusters.

Continue to Part 2: Deep Dive into CUDA to learn about the CUDA workshops, CuTile programming paradigm, and the comprehensive CUDA-X library ecosystem.