Paul Graham

Accelerating Python on GPUs

Accelerate your existing NumPy and pandas code with massive speedups on GPUs. Learn about drop-in replacement libraries that require minimal code changes.

Accelerating Python on GPUs
#1about 1 minute

The evolution of GPU programming with Python

Python has become a first-class citizen in the CUDA ecosystem, making it easier to accelerate software on GPUs.

#2about 2 minutes

How GPUs evolved from graphics to AI powerhouses

The development of CUDA unlocked general-purpose GPU computing, which was supercharged by the AlexNet breakthrough in AI.

#3about 2 minutes

Understanding modern GPU architecture for parallelism

A look inside a modern data center GPU reveals thousands of cores and specialized hardware like Tensor Cores designed for massive parallelism.

#4about 2 minutes

Navigating the CUDA Python software ecosystem

The CUDA platform provides a layered stack of libraries, frameworks, and tools to access GPU power at your preferred level of abstraction.

#5about 3 minutes

Using high-level frameworks like Rapids for acceleration

Frameworks like Rapids provide GPU-accelerated versions of tools like pandas and scikit-learn, often requiring zero code changes for massive speedups.

#6about 1 minute

Using CuPy as a drop-in replacement for NumPy

CuPy offers a familiar NumPy-like API that allows you to move array computations to the GPU by simply changing the import statement.

#7about 5 minutes

Optimizing code with nvmath-python and a case study

The nvmath-python library enables kernel fusion for significant speedups, as demonstrated by a supernova detection project that went from 45 minutes to one minute.

#8about 2 minutes

A look at upcoming Python GPU programming tools

New tools like CuTe for array-based programming and Python bindings for CUDA Core Compute Libraries are making GPU development even more accessible.

#9about 2 minutes

Strategies for scaling your code to multiple GPUs

Explore various approaches for multi-GPU programming, from high-level libraries like Dask and JAX to lower-level communication libraries like NCCL and NVSHMEM.

#10about 2 minutes

Profiling and debugging your GPU applications

Use essential developer tools like Nsight Systems and Nsight Compute to profile your application, identify bottlenecks, and optimize performance.

#11about 2 minutes

Resources for getting started with GPU programming

Find examples, labs, and free courses through the NVIDIA Accelerated Compute Hub and Developer Program to begin your GPU programming journey.

Related jobs
Jobs that call for the skills explored in this talk.

job ad

Saby Company
Delebio, Italy

Intermediate

d

Saby Company
Delebio, Italy

Junior

Featured Partners

Related Articles

View all articles
CH
Chris Heilmann
All the videos of Halfstack London 2024!
Last month was Halfstack London, a conference about the web, JavaScript and half a dozen other things. We were there to deliver a talk, but also to record all the sessions and we're happy to share them with you. It took a bit as we had to wait for th...
All the videos of Halfstack London 2024!
TL
Thomas Limbüchler
7 good reasons why you should learn Python in 2021
Python is already more than 30 years old. What started as a hobby nerd project during Christmas in 1989 has become one of the most popular programming languages, according to Stack Overflow and GitHub. Despite its age, the programming language is mor...
7 good reasons why you should learn Python in 2021
LM
Luis Minvielle
The 13 Best Python Libraries for Developers in 2025
Python still stands as one of the three most popular programming languages because it’s incredibly useful for data scraping, data engineering, and data analysis — meaning non-programmers that are handy with numbers, such as accountants or Economics B...
The 13 Best Python Libraries for Developers in 2025

From learning to earning

Jobs that call for the skills explored in this talk.