Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 2 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,26 +37,21 @@ Below is a sample of the GPU specifications data included in the repository:
Attribute (Unit) | H100 | L40S | A100 PCIe 80GB
--- | --- | --- | ---
FP64 (TFLOPS) | 25.6 | 1.4 | 9.7
FP64 Tensor Core (TFLOPS) | 51 | ? | 19.5
FP64 Tensor Core (TFLOPS) | 51 | 1.4 | 19.5
FP32 (TFLOPS) | 51.2 | 91.6 | 19.5
TF32 Tensor Core (TFLOPS) | ? | 183 | 156
TF32 Tensor Core with Sparsity (TFLOPS) | 756 | 366 | 312
FP16 (TFLOPS) | 204.9 | 91.6 | 78
FP16 Tensor Core (TFLOPS) | ? | 362 | 312
FP16 Tensor Core with Sparsity (TFLOPS) | ? | 733 | 624
BF16 (TFLOPS) | ? | ? | ?
BF16 Tensor Core (TFLOPS) | ? | 362 | 312
BF16 Tensor Core with Sparsity (TFLOPS) | 1513 | 733 | 624
FP8 (TFLOPS) | N/A | N/A | N/A
FP8 Tensor Core (TFLOPS) | ? | 733 | N/A
FP8 Tensor Core with Sparsity (TFLOPS) | 3026 | 1466 | N/A
FP4 (TFLOPS) | N/A | N/A | N/A
FP4 Tensor Core (TFLOPS) | N/A | N/A | N/A
FP4 Tensor Core with Sparsity (TFLOPS) | N/A | N/A | N/A
INT8 (TOPS) | ? | ? | ?
INT8 Tensor Core (TOPS) | ? | 733 | 624
INT8 Tensor Core with Sparsity (TOPS) | 3026 | 1466 | 1248
INT4 (TOPS) | N/A | N/A | N/A
INT4 Tensor Core (TOPS) | ? | 733 | ?
INT4 Tensor Core with Sparsity (TOPS) | ? | 1466 | ?
**Architecture Details** | | | |
Expand All @@ -68,6 +63,7 @@ NVIDIA RT Cores | ? | 142 (3rd gen) | ?
NVIDIA Tensor Cores | 456 (4th gen) | 568 (4th gen) | 432 (3rd gen)
NVIDIA CUDA Cores | 14592 | 18176 | 6912
GPU Memory (GB) | 80 | 48 | 80
Memory Type | HBM2e | GDDR6 | HBM2e
Memory Bandwidth (GB/s) | 2048 | 864 | 1935
Interconnect Type | PCIe Gen5 | PCIe Gen4 | PCIe Gen4
Encoders and Decoders | 0, 7 | 3, 3 | 0, 5
Expand Down
Loading