zCUDA Documentation Documentation Index API Reference API Overview & Cross-Reference — Complete module listing with function signatures and CUDA mapping Modules Module Doc Description driver README Device management, memory, kernel launch, streams, events nvrtc README Runtime compilation of CUDA C++ to PTX / CUBIN cublas README BLAS Level 1/2/3 (SAXPY, SGEMM, DGEMM, batched, mixed-precision) cublaslt README Lightweight GEMM with algorithm heuristics curand README GPU random number generation cudnn README Convolution, activation, pooling, softmax, batch norm cusolver README LU, QR, SVD, Cholesky, eigenvalue decomposition cusparse README SpMV, SpMM, SpGEMM with CSR/COO formats cufft README 1D/2D/3D Fast Fourier Transform nvtx README Profiling annotations for NVIDIA Nsight kernel API Kernel DSL — write CUDA kernels in pure Zig, compiled to PTX Guides Kernel DSL API Reference — intrinsics, shared memory, WMMA/MMA, TMA, cluster, tcgen05 CUDA C++ → Zig Migration — port existing CUDA C++ kernels to pure Zig Examples Guide — 162 examples: 58 host (10 categories with per-category READMEs) + 80 kernel (11 categories) + 24 integration Project README — Quick start, build options, and project overview