NVIDIA Corporation
- 10.3k followers
- 2788 San Tomas Expressway, Santa Clara, CA, 95051
- http://www.nvidia.com
Pinned Loading
Repositories
- TensorRT-Model-Optimizer Public
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
NVIDIA/TensorRT-Model-Optimizer’s past year of commit activity - modulus-sym Public
Framework providing pythonic APIs, algorithms and utilities to be used with Modulus core to physics inform model training as well as higher level abstraction for domain experts
NVIDIA/modulus-sym’s past year of commit activity - cuda-quantum Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
NVIDIA/cuda-quantum’s past year of commit activity - TransformerEngine Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
NVIDIA/TransformerEngine’s past year of commit activity