Beta

GPU Programming

Instructor Notes

This is a placeholder file. Please add content here.

IntroductionGraphics Processing Unit Parallel by DesignSpeed Benefits

Using your GPU with CuPyIntroduction to CuPy Convolutions in PythonA scientific application: image processing for radio astronomy

Accelerate your Python code with NumbaUsing Numba to execute Python code on the GPU

A Better Look at the GPUThe GPU, a High Level View at the Hardware How Programs are ExecutedDifferent MemoriesAdditional Material

Your First GPU KernelSumming Two Vectors in Python Summing Two Vectors in CUDARunning Code on the GPU with CuPyUnderstanding the CUDA CodeComputing Hierarchy in CUDAVectors of Arbitrary Size

Registers, Global, and Local MemoryRegisters Global MemoryLocal Memory

Shared Memory and SynchronizationShared Memory Thread Synchronization

Constant MemoryConstant Memory

Concurrent access to the GPUConcurrently execute two kernels on the same GPU Stream synchronizationMeasure execution time using streams and events