Analyzing Modern Nvidia GPU Cores by mfiguiere
[Submitted on 26 Mar 2025]
Abstract:GPUs are the most popular platform for accelerating HPC workloads, such as artificial intelligence and science simulations. However, most microarchitectural research in academia relies on GPU core pipeline designs based on architectures that are more than 15 years old.
This paper reverse engineers modern NVIDIA GPU cores, unveiling many key aspects of its design and explaining how GPUs leverage hardware-compiler techniques w
3 Comments
fafdewfeae
[dead]
winwang
I hope this can help shed the misconception that GPUs are only good at linear algebra and FP arithmetic, which I've been hearing a whole lot!
Edit: learned a bunch, but the "uniform" registers and 64-bit (memory) performance are some easy standouts.
gmays
The special sauce:
> "GPUs leverage hardware-compiler techniques where the compiler guides hardware during execution."