๐ฆ ๐งฐ gpu toolkit Collection of basic GPU algorithms implemented in CUDA C++. ๐ช Algorithms ๐ Array ๐ผ๏ธ Image Processing ๐ Computer Graphics ๐ Mathematical ๐ฆพ Linear Algebra ๐งฎ Number Theory ๐ Sorting ๐งฌ String ๐จโ๐ซ Learning Resources ๐ฝ๏ธ CUDA Streams ๐บ CUDA Tutorials by Creel ๐จโ๐ป CUDA Samples by NVIDIA ๐จโ๐ป CUDA C++ Programming Guide ๐บ Intro to CUDA by Josh Holloway ๐จโ๐ป CUDA Training Series by NVIDIA ๐จโ๐ป CUDA Examples by Sylvain Jubertie ๐ฝ๏ธ CUDA C/C++ Basics Slides by NVIDIA ๐ฝ๏ธ GPU Memory Slides by Robert Dalrymple ๐บ CUDA Crash Course by CoffeeBeforeArch ๐ญ CUDA Developer Blog Posts by Mark Harris ๐ฝ๏ธ Parallel Prefix Sum with CUDA by Mark Harris ๐ญ CUDA Basics Blog Posts by Sebastian Eberhardt ๐บ Parallel and Distributed Computing by Tom Nurkkala ๐จโ๐ป GPU Programming: When, Why and How? by ENCCS ๐ฝ๏ธ CUDA: Threaded Programming Slides by Kevin Cooper ๐บ Parallel Programming Concepts Course by Peter Trรถger ๐ฝ๏ธ Advanced CUDA: Memory Optimization Slides by NVIDIA ๐ฝ๏ธ Optimizing Parallel Reduction in CUDA Slides by Mark Harris ๐ฝ๏ธ Advanced CUDA: Performance Optimization Slides by NVIDIA ๐ฝ๏ธ Better Performance at Lower Occupancy Slides by Vasily Volkov ๐ฝ๏ธ GPU Performance Analysis and Optimization by Paulius Micikevicius ๐ฝ๏ธ General-purpose GPU Computing with CUDA Course by Will Landau ๐จโ๐ป GPU Programming and Architecture from University of Pennsylvania ๐ฝ๏ธ CUTLASS: CUDA Template Library Slides by Andrew Kerr and others ๐จโ๐ป Learn CUDA Programming by Jaegeun Han and Bharatkumar Sharma ๐ญ Accelerated Ray Tracing in One Weekend in CUDA Blog post by Roger Allen ๐จโ๐ซ CUDA and Applications to Task-based Programming Eurographics 2021 Tutorial ๐บ Introduction to GPU Programming with CUDA and Thrust Talk by Richard Thomson ๐ญ CUDA Pro Tip: Optimized Filtering with Warp-Aggregated Atomics by Andy Adinets