Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
day1		day1
day10		day10
day11		day11
day12		day12
day13		day13
day2		day2
day3		day3
day4		day4
day5		day5
day6		day6
day7		day7
day8		day8
day9		day9
matrix_add		matrix_add
README.md		README.md

Repository files navigation

CUDA course (Compute Unified Device Architecture)

Title: Fundamentals of Accelerated Computation Using CUDA C/C++

// Oxford course link https://people.maths.ox.ac.uk/~gilesm/cuda/

// labs https://iis-people.ee.ethz.ch/~gmichi/asocd_2014/exercises/ex_03.pdf // lectures from the link

// Nvidia official c programming guide https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html https://docs.nvidia.com/cuda/pdf/CUDA_C_Programming_Guide.pdf

// Programming model https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#programming-model

// Parallel programming with CUDA https://www.davidmuench.de/studienarbeit.pdf

//

// Intorduction

https://people.maths.ox.ac.uk/~gilesm/cuda/2019/lecture_01.pdf
tasks, trivial vector addition example

// Different memory and variable types - basic kernel implementation

https://people.maths.ox.ac.uk/~gilesm/cuda/2019/lecture_02.pdf

// control flow, atomics

https://people.maths.ox.ac.uk/~gilesm/cuda/2019/lecture_03.pdf

// warp based programming model -> too complex after lecture 2, need to do computational tasks

https://people.maths.ox.ac.uk/~gilesm/cuda/2019/lecture_04.pdf

// libraries, reomve this topic and provide computational tasks

https://people.maths.ox.ac.uk/~gilesm/cuda/2019/lecture_05.pdf

// streams and host related code

https://people.maths.ox.ac.uk/~gilesm/cuda/2019/lecture_06.pdf

// Usefull links https://www.mat.unimi.it/users/sansotte/cuda/CUDA_by_Example.pdf

// Nvidia lectures https://developer.nvidia.com/educators/existing-courses#2

// cuda-gdb set cuda break_on_launch application cuda device sm warp lane block thread //step

Exam notes

kernels and launch
warp and operations
shared memory
paged/pinned memory
atomic operations and global memory
mapped memory
memory transfers, sync/async launch
streams and stream synchronization
graph and graph record
texture memory and binding
texture memory