-
Notifications
You must be signed in to change notification settings - Fork 0
High performance GPU engine for matrix multiplication, a core operation in AI/ML. Implemented in c++ / CUDA, optimized memory access and computation for maximum throughput, achieving significant speedups over CPU. Demonstrates expertise in GPU programming, parallel computing, and high performance computing.
Kugman/CudaMatrixEngine
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
About
High performance GPU engine for matrix multiplication, a core operation in AI/ML. Implemented in c++ / CUDA, optimized memory access and computation for maximum throughput, achieving significant speedups over CPU. Demonstrates expertise in GPU programming, parallel computing, and high performance computing.
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published