Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 629 104

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 391 62

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.5k 1.5k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.7k 231

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.9k 455

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.7k 943

Repositories

Showing 10 of 645 repositories
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 12,470 1,976 519 485 Updated Dec 25, 2025
  • edk2 Public

    NVIDIA fork of tianocore/edk2

    NVIDIA/edk2’s past year of commit activity
    C 25 16 0 15 Updated Dec 25, 2025
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 14,702 3,409 339 (1 issue needs help) 256 Updated Dec 25, 2025
  • doca-platform Public

    DOCA Platform manages provisioning and service orchestration for Bluefield DPUs

    NVIDIA/doca-platform’s past year of commit activity
    Go 64 Apache-2.0 16 0 0 Updated Dec 25, 2025
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 1,721 Apache-2.0 222 57 57 Updated Dec 25, 2025
  • gpu-operator Public

    NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes

    NVIDIA/gpu-operator’s past year of commit activity
    Go 2,462 Apache-2.0 431 94 67 Updated Dec 25, 2025
  • recsys-examples Public

    Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

    NVIDIA/recsys-examples’s past year of commit activity
    Python 193 39 38 8 Updated Dec 25, 2025
  • TileGym Public

    Helpful kernel tutorials and examples for tile-based GPU programming

    NVIDIA/TileGym’s past year of commit activity
    Python 493 28 0 2 Updated Dec 25, 2025
  • nccl Public

    Optimized primitives for collective multi-GPU communication

    NVIDIA/nccl’s past year of commit activity
    C++ 4,332 1,099 190 73 Updated Dec 25, 2025
  • skyhook Public

    A Kubernetes Operator to manage Node OS customizations.

    NVIDIA/skyhook’s past year of commit activity
    Go 35 Apache-2.0 3 0 0 Updated Dec 25, 2025