GitHub - Kugman/CudaMatrixEngine: High performance GPU engine for matrix multiplication, a core operation in AI/ML. Implemented in c++ / CUDA, optimized memory access and computation for maximum throughput, achieving significant speedups over CPU. Demonstrates expertise in GPU programming, parallel computing, and high performance computing.

Kugman / CudaMatrixEngine Public

Notifications You must be signed in to change notification settings
Fork 0
Star 2

High performance GPU engine for matrix multiplication, a core operation in AI/ML. Implemented in c++ / CUDA, optimized memory access and computation for maximum throughput, achieving significant speedups over CPU. Demonstrates expertise in GPU programming, parallel computing, and high performance computing.