Lecture 7

Learning objectives

After this class, you should be able to:

  1. Describe the following CUDA features: cudaStreamCreate, cudaStreamDestroy, cudaMemcpyAsync, and calling kernels with a stream parameter.
  2. Use the above features to hide data transfer overhead through multiple bufferring.
  3. Write code that uses memory coalescing to reduce the data transfer overhead in accessing DRAM.

Reading assignment

  1. Read the Lecture 7, UIUC Lecture 6-7 slides (until slide 15), UIUC Lecture 7 slides.
  2. Chapter 6 of text.

Exercises and review questions


Last modified: 29 Jan 2013