Summary and Schedule

This is the GPU programming lesson.

Do you want to teach this lesson?

Do you want to teach GPU programming? This material is open-source and freely available. Are you planning on using our material in your teaching? Send us an email at training@esciencecenter.nl. We would love to help you prepare to teach the lesson and receive feedback on how it could be further improved, based on your experience in the workshop.

Setup Instructions

Download files required for the lesson

00h 00m

1. Introduction

“What is a Graphics Processing Unit?”
“Can a GPU be used for anything else than graphics?”
“Are GPUs faster than CPUs?”
::::::::::::::::::::::::::::::::::::::

00h 15m

2. Using your GPU with CuPy

“How can I increase the performance of code that uses NumPy?”
“How can I copy NumPy arrays to the GPU?”
::::::

04h 45m

3. Accelerate your Python code with Numba

“How can I run my own Python functions on the GPU?”
::::::::::::::::::::::::::::::::::::::

05h 45m

4. A Better Look at the GPU

“How does a GPU work?”
::::::::::::::::::::::::::::::::::::::

06h 05m

5. Your First GPU Kernel

“How can I parallelize a Python application on a GPU?”
“How to write a GPU program?”
“What is CUDA?”
::::::::::::::::::::::::::::::::::::::

07h 15m

6. Registers, Global, and Local Memory

“What are registers?”
“How to share data between host and GPU?”
“Which memory is accessible to threads and thread blocks?”
::::::::::::::::::::::::::::::::::::::

08h 00m

7. Shared Memory and Synchronization

“Is there a way to share data between threads of a same block?”
“Can threads inside a block wait for other threads?”
::::::::::::::::::::::::::::::::::::::

08h 55m

8. Constant Memory

“Is there a way to have a read-only cache in CUDA?”
::::::::::::::::::::::::::::::::::::::

09h 35m

9. Concurrent access to the GPU

“Is it possible to concurrently execute more than one kernel on a single GPU?”
::::::::::::::::::::::::::::::::::::::

10h 15m

Finish

The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.

Programming environment

The GPU programming lesson can be taught using Jupyter Notebook, a programming environment that runs in a web browser. For this to work you we need a reasonably up-to-date browser. The current versions of the Chrome, Safari and Firefox browsers are all supported (some older browsers, including Internet Explorer version 9 and below, are not).

In case you do not have any GPU available on your laptop, a good alternative is to use Google Colab.

Local setup

To setup locally, depending on how you installed Python, there are two alternatives: - use pip if you installed Python normally using your OS’s package manager or app store, - use conda or mamba if you installed the conda distribution of Python.

In case you don’t have Python installed, we recommend you start with a variant of the conda distribution: mambaforge. mambaforge by default sets the conda-forge channel as the default, and provides the alternative package manager mamba. mamba is a lot more performant compared to conda, making the user experience significantly smoother.

Whichever case it is for you, the first step is to create an isolated environment for the workshop, this way you won’t interfere with your existing setup. You can install all the dependencies for the workshop within this environment. In the Python ecosystem, this kind of isolated environments are known as virtual environments.

Using `pip`

To create a virtual environment using pip, you need to install the virtualenv package using your OS’s package manager (it may have alternate names like python-virtualenv or python3-virtualenv). After you have done this, you can follow the steps below:

BASH

cd /path/to/workshop/dir
python -m virtualenv --prompt gpu-workshop venv
source venv/bin/activate
pip install -U pip  # it is good to update pip to the latest version
pip install cupy-cuda11x numba jupyterlab matplotlib scipy astropy

Callout

We are installing the precompiled cupy libraries compiled against the latest version of CUDA. This is always faster to install, but if you want to use a custom CUDA installation, you can pip install cupy instead. Also note, if you also want the cuda compiler nvcc, you have to install the CUDA toolkit manually. However, this is not required to follow the workshop. More information can be found in the cupy documentation.

Using `conda` or `mamba`

conda or mamba have support for virtual environments built-in. You can create a new virtual environment with

BASH

mamba create -n gpu-workshop
mamba activate gpu-workshop
mamba install cupy numba jupyterlab matplotlib scipy astropy

If you are using conda, you can simply replace mamba with conda in the commands above.

Starting a Jupyter server

Now you can start your Jupyter server as shown below, which will open a tab with Jupyter in your default browser:

BASH

jupyter-lab

If you do not want Jupyter to open a tab in your browser automatically, you can use the alternative below:

BASH

jupyter-lab --no-browser

This will print out a url in your terminal, which you can then open in the browser of your choice.

Summary and Schedule

Do you want to teach this lesson?

Programming environment

Local setup

Using pip

BASH

Callout

Using conda or mamba

BASH

Starting a Jupyter server

BASH

BASH

Using `pip`

Using `conda` or `mamba`