cuda-performance-evaluation

Computer Architecture (CSD208) Project

The project aims to find performance improvements of commonly used algorithms on graphics processing units (GPUs) using General Purpose Programming on GPUs (GPGPUs). We have used Nvidia’s CUDA library for programming. Having ran algorithms of various time complexities and parallelizability, we were able to observe interesting behaviours of these brilliant devices.

We ran our benchmarks on two machines, a Lenovo Gaming Laptop with an Nvidia GTX 1050, and a workstation-class Nvidia K80 cloud-hosted on an Amazon Web Services (AWS) virtualized server. We had initially planned to run on the GPU cluster on our university’s high performance computer, Magus, however, due to unavailability of the GPU cluster, we had to abandon this plan.

To obtain fast performance without the overhead of garbage collection and dynamic memory allocation, we wrote all of our benchmarks using C++14. All the code was compiled using Nvidia’s CUDA Compiler (NVCC) which is based on the popular open-source optimizing Clang LLVM compiler.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
T4		T4
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
T4.sln		T4.sln
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cuda-performance-evaluation

Computer Architecture (CSD208) Project

References

About

Releases

Packages

Contributors 2

Languages

Dev-eloperr/cuda-performance-evaluation

Folders and files

Latest commit

History

Repository files navigation

cuda-performance-evaluation

Computer Architecture (CSD208) Project

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages