Lecture 20

Learning objectives

After this class, you should be able to:

  1. Explain stream scheduling in different generations of Nvidia GPUs.
  2. Use the above knowledge to improve performance (i) when overlapping computation and data movement and (ii) when trying to execute kernels concurrently.

Reading assignment

  1. UIUC Lecture 21b.
  2. Read the Nvidia webinar: StreamsAndConcurrencyWebinar.pdf.
  3. Look up internet resources to learn about timinng using CUDA Events.

Exercises and review questions


Last modified: 26 Mar 2013