5 problem sets of parallel programming on CPU and GPU. University projects for High Performance Computing Systems (Fall 2016).
Programmed with CUDA and C on GPU and CPU. Explored both homogeneous and heterogeneous parallel architects and introduced to performance measurement techniques, profiling, experimental evaluation of software interaction with underlying hardware and optimization.
- Operator Sobel
- PSNR
- Code optimization: Loop interchange, loop unrolling, loop fusion, function inling, loop invariant code motion, common subexpression elimination, strength reduction
- Performance analysis and profiling
- OpenMP
- Multi-threaded CPU programming
- K-means clustering
- Performance analysis and profiling
- Convolution filter
- Tiling
- Divergences
- PTXAS
- Convolution
- Histogram
- Mapping
- Contrast enhancement
- Histogram equalization
- CUDA
- C
- Make (sudo apt install make)
- GCC (sudo apt install gcc)