NVIDIA Corporation
- 16.7k followers
- 2788 San Tomas Expressway, Santa Clara, CA, 95051
- https://nvidia.com
Pinned Loading
Repositories
- TensorRT-LLM Public
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
NVIDIA/TensorRT-LLM’s past year of commit activity - nv-ingest Public
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
NVIDIA/nv-ingest’s past year of commit activity - KAI-Scheduler Public
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
NVIDIA/KAI-Scheduler’s past year of commit activity - cuda-quantum Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
NVIDIA/cuda-quantum’s past year of commit activity - TransformerEngine Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
NVIDIA/TransformerEngine’s past year of commit activity