749
Public repositories
272,215
Total stars
26,609
Followers
NVIDIA Corporation maintains a significant public presence on GitHub, hosting a wide range of repositories primarily focused on machine learning, deep learning, and GPU computing. Their notable projects include TensorRT, Megatron-LM, and nvidia-docker, with primary programming languages such as Python, C++, and Jupyter Notebook used across various initiatives.
Run agents like Hermes and OpenClaw more securely inside NVIDIA OpenShell with managed inference
Build and run Docker containers leveraging NVIDIA GPUs
NVIDIA Linux open GPU kernel module source
Ongoing research training transformer models at scale
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Style transfer, deep learning, feature transform
NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.
PersonaPlex code.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
the LLM vulnerability scanner
NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.
OpenShell is the safe, private runtime for autonomous AI agents.
Synthesizing and manipulating 2048x1024 images with conditional GANs
A Python framework for GPU-accelerated simulation, robotics, and machine learning.
Transformer related optimization, including BERT, GPT
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Optimized primitives for collective multi-GPU communication
Deep Learning GPU Training System
NVIDIA device plugin for Kubernetes
Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks.
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
CUDA Python: Performance meets Productivity
Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
CUDA Core Compute Libraries
AIStore: scalable storage for AI applications
NVIDIA curated collection of educational resources related to general purpose GPU programming.
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
AI agent skills published by NVIDIA
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Collection of step-by-step playbooks for setting up AI/ML workloads on NVIDIA DGX Spark devices with Blackwell architecture.
GPU accelerated decision optimization
NVIDIA Federated Learning Application Runtime Environment
Official Codebase for "DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos" (ICML 2026)
cuDNN Frontend is NVIDIA's modern, open-source entry point to the cuDNN library and a growing collection of high-performance open-source kernels.
NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs
DreamGen: Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
CUDA checkpoint and restore utility
High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI
SOMA BVH to humanoid robot motion retargeting library built with Newton and NVIDIA Warp
JAX-Toolbox
An SDK (Software Development Kit) for building commercial-grade, AI-native, 3GPP, and O-RAN compliant 5G/6G gNB software on NVIDIA-accelerated computing platforms.
Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes
NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
high-performance inference and serving library for interactive autoregressive video and world models
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
NVIDIA OptiX based implementation of ANARI
MIG Partition Editor for NVIDIA GPUs
The unified framework for sim & real robot teleoperation
Running object detection on a webcam feed using TensorRT on NVIDIA GPUs in Python.
Our inference and training framework to run on the Cosmos Models
A benchmark of real-world DL kernel problems
NVAPI is NVIDIA's core software development kit that allows direct access to NVIDIA GPUs and drivers on supported platforms.
NVIDIA Infra Controller - Hardware Lifecycle Management and multitenant networking
The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
Data representations, APIs, and tools for high quality AV and robotics applications
Platform for deploying and routing GPU-accelerated inference, streaming, and batch workloads at scale.
Golang bindings for Nvidia Datacenter GPU Manager (DCGM)
Kubernetes Operator, Helm Charts, Ansible Playbooks, and utility scripts for large-scale AIStore deployments on Kubernetes.
A toolkit for discovering cluster network topology.
Accelerated libraries for quantum-classical computing built on CUDA-Q.
Ubuntu kernels which are optimized for NVIDIA server systems
CloudAI Benchmark Framework
ALCHEMI Toolkit is a developer toolkit for accelerating training and inference for AI in chemistry and material science.
An Optimizer for Nvidia Compilers.
Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs
JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training
Unified high-performance Python client for object and file stores.
Accelerated Computer Vision Lab (ACCV-Lab) is a systematic collection of packages with the common goal to facilitate end-to-end efficient training in the ADAS domain, each package offering tools & best practices for a specific aspect/task in this domain.
Multi-language agent runtime for execution scope management, lifecycle events, and middleware on tool and LLM calls.
RAPIDS Accelerator JNI For Apache Spark
A Kubernetes Operator to manage Node OS customizations.
The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.
GPU Memory Reservation Library
Spark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark
Linux driver for diagnostic software
SimReady Foundation is a central repository for defining simulation content specifications based on various runtime use cases.
repo for Numba-CUDA-MLIR
Harmonizer is an online generative enhancement framework that transforms renderings from imperfect scenes into temporally consistent outputs while improving their realism.
Cosmos Evaluator is an automated evaluation & grading system for synthetic video output generated by Cosmos models
K8s-test-infra
NVIDIA GPU Accelerated Application Samples in Google Cloud Platform
NVIDIA NMOS (Networked Media Open Specifications) Library
InstantNuRec: Feed-Forward 3D Gaussian Reconstruction from Driving Logs
DGX RHEL SELinux Policies
Validation and management tools for NVIDIA ISV Lab environments.
The Design System and UI Agent Harness for AI/ML Factories, Robotics, and Autonomous Vehicles
DAQIRI connects high bandwidth streaming sensor data to the NVIDIA software ecosystem
A Rust Crate for interacting with DTMF Redfish endpoints
NVIDIA Switch Infrastructure - Config Manager
Packages for the Skyhook Kubernetes Operator.
Media Function Operator Kit
NVIDIA builds various projects on GitHub, focusing on machine learning and deep learning frameworks. Notable repositories include TensorRT for deep learning inference and Megatron-LM for training large transformer models.
NVIDIA primarily uses Python, C++, Go, C, Jupyter Notebook, and Rust in their GitHub repositories. These languages support their efforts in high-performance computing and AI development.
Yes, NVIDIA's repositories on GitHub are public. This allows developers and researchers to access their extensive collection of tools and resources related to GPU computing and AI.
Monitor NVIDIA Corporation with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account