NVIDIA Corporation

All

587 repositories

TensorRT-LLM
Public
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
cuda pytorch moe blackwell llm-serving
C++
•
Apache License 2.0
•1.7k•11k•719•354•Updated Aug 8, 2025Aug 8, 2025
cuda-quantum
Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
python cpp quantum quantum-computing hacktoberfest quantum-programming-language quantum-algorithms quantum-machine-learning unitaryhack
C++
•
Other
•268•766•371•79•Updated Aug 8, 2025Aug 8, 2025
NeMo
Public
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models
Python
•
Apache License 2.0
•3k•15k•67•66•Updated Aug 8, 2025Aug 8, 2025
Megatron-LM
Public
Ongoing research training transformer models at scale
transformers model-para large-language-models
Python
•
Other
•3k•13k•267•113•Updated Aug 8, 2025Aug 8, 2025
NVFlare
Public
NVIDIA Federated Learning Application Runtime Environment
python decentralized pet privacy-protection federated-learning federated-analytics federated-computing
Python
•
Apache License 2.0
•208•772•12•10•Updated Aug 8, 2025Aug 8, 2025
spark-rapids
Public
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
big-data spark gpu rapids
Scala
•
Apache License 2.0
•254•918•1.6k•33•Updated Aug 8, 2025Aug 8, 2025
NeMo-Agent-Toolkit
Public
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
Python
•
Apache License 2.0
•313•1.2k•62•19•Updated Aug 8, 2025Aug 8, 2025
numba-cuda
Public
The CUDA target for Numba
Python
•
BSD 2-Clause "Simplified" License
•34•164•94•27•Updated Aug 8, 2025Aug 8, 2025
Fuser
Public
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++
•
Other
•64•346•184•158•Updated Aug 8, 2025Aug 8, 2025
topograph
Public
A toolkit for discovering cluster network topology.
Go
•
Apache License 2.0
•6•63•1•1•Updated Aug 8, 2025Aug 8, 2025
spark-rapids-benchmarks
Public
Spark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark
Python
•
Apache License 2.0
•34•41•25•3•Updated Aug 8, 2025Aug 8, 2025
TensorRT-Incubator
Public
Experimental projects related to TensorRT
MLIR
•17•109•39•13•Updated Aug 8, 2025Aug 8, 2025
cuopt
Public
NVIDIA cuOpt is an open-source GPU-accelerated optimization engine delivering near real-time solutions for complex decision-making challenges.
Cuda
•
Apache License 2.0
•55•350•64•10•Updated Aug 8, 2025Aug 8, 2025
cccl
Public
CUDA Core Compute Libraries
cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing
C++
•
Other
•251•1.8k•1k•158•Updated Aug 8, 2025Aug 8, 2025
spark-rapids-jni
Public
RAPIDS Accelerator JNI For Apache Spark
Cuda
•
Apache License 2.0
•72•49•76•11•Updated Aug 8, 2025Aug 8, 2025
NeMo-Skills
Public
A project to improve skills of large language models
Python
•
Apache License 2.0
•90•506•36•7•Updated Aug 8, 2025Aug 8, 2025
cloud-native-docs
Public
Documentation repository for NVIDIA Cloud Native Technologies
kubernetes containers kubernetes-operator
PowerShell
•
Apache License 2.0
•28•26•4•4•Updated Aug 8, 2025Aug 8, 2025
holodeck
Public
Holodeck is a project to create test environments optimised for GPU projects.
Go
•
Apache License 2.0
•7•18•1•8•Updated Aug 8, 2025Aug 8, 2025
nv-ingest
Public
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
Python
•
Apache License 2.0
•258•2.7k•88•18•Updated Aug 8, 2025Aug 8, 2025
warp
Public
A Python framework for accelerated simulation, data generation and spatial computing.
python gpu cuda nvidia gpu-acceleration differentiable-programming nvidia-warp
Python
•
Apache License 2.0
•341•5.4k•234•10•Updated Aug 8, 2025Aug 8, 2025
MatX
Public
An efficient C++17 GPU numerical computing library with Python-like syntax
hpc gpu cuda gpgpu gpu-computing
C++
•
BSD 3-Clause "New" or "Revised" License
•103•1.3k•40•8•Updated Aug 8, 2025Aug 8, 2025
spark-rapids-ml
Public
Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs
Jupyter Notebook
•
Apache License 2.0
•32•83•33•3•Updated Aug 8, 2025Aug 8, 2025
cudaqx
Public
Accelerated libraries for quantum-classical computing built on CUDA-Q.
C++
•
Other
•24•53•20•8•Updated Aug 7, 2025Aug 7, 2025
NeMo-Agent-Toolkit-UI
Public
The NVIDIA AIQToolkit UI streamlines interacting with AIQToolkit workflows in an easy-to-use web application.
TypeScript
•
Other
•23•38•4•7•Updated Aug 7, 2025Aug 7, 2025
skyhook
Public
A Kubernetes Operator to manage Node OS customizations.
Go
•
Apache License 2.0
•3•24•0•1•Updated Aug 7, 2025Aug 7, 2025
physicsnemo-cfd
Public
Library for using the models trained in PhysicsNeMo in Engineering and CFD workflows
Jupyter Notebook
•
Apache License 2.0
•4•18•0•1•Updated Aug 7, 2025Aug 7, 2025
cloudai
Public
CloudAI Benchmark Framework
Python
•
Apache License 2.0
•32•68•0•13•Updated Aug 7, 2025Aug 7, 2025
nvrc
Public
The NVRC project provides a Rust binary that implements a simple init system for microVMs.
Rust
•
Apache License 2.0
•5•12•3•2•Updated Aug 7, 2025Aug 7, 2025
cuCollections
Public
datastructures cpp gpu cuda hashmap cpp17 hashset hashtable
C++
•
Apache License 2.0
•97•564•56•21•Updated Aug 7, 2025Aug 7, 2025
vgpu-device-manager
Public
NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes
Go
•
Apache License 2.0
•22•138•0•7•Updated Aug 7, 2025Aug 7, 2025