Skip to content

High performance GPU engine for matrix multiplication, a core operation in AI/ML. Implemented in c++ / CUDA, optimized memory access and computation for maximum throughput, achieving significant speedups over CPU. Demonstrates expertise in GPU programming, parallel computing, and high performance computing.

Notifications You must be signed in to change notification settings

Kugman/CudaMatrixEngine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

About

High performance GPU engine for matrix multiplication, a core operation in AI/ML. Implemented in c++ / CUDA, optimized memory access and computation for maximum throughput, achieving significant speedups over CPU. Demonstrates expertise in GPU programming, parallel computing, and high performance computing.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages