Library/TensorRT
Library/TensorRTForked

NVIDIA/TensorRT

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Builder

NVIDIA

NVIDIA

NVIDIA • big-tech

Stars

12,858

Using upstream star count

Forks

2,336

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 2, 2019

Project creation date

README Summary

NVIDIA TensorRT is an SDK designed for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT, providing developers with tools to optimize and deploy neural networks for production environments. It focuses on maximizing inference performance through various optimization techniques including layer fusion, precision calibration, and kernel auto-tuning.

AI Dev Skills

Unmapped

Deep Learning Inference OptimizationGPU ComputingModel QuantizationCUDA ProgrammingNeural Network AccelerationPerformance OptimizationModel DeploymentLow-latency InferenceBatch Processing OptimizationMemory Management

Tags

Deep Learning Inference OptimizationGPU ComputingModel QuantizationCUDA ProgrammingNeural Network AccelerationPerformance OptimizationModel DeploymentLow-latency InferenceBatch Processing OptimizationMemory ManagementOn-premiseProduction AIVideoAutonomous Vehicle InferenceOn-device AILatency-critical ApplicationsHigh-throughput Batch ProcessingProduction Model DeploymentTextComputer Vision ApplicationsCloud APIEdge AI AccelerationEdge/MobileNatural Language Processing AccelerationModel Serving OptimizationSelf-hostedAudioImageReal-time AI InferenceMultimodalModel OptimizationEdge ComputingC++

Taxonomy

Recent Activity

Updated 1 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

Inference & ServingPrimaryNLP & TextCoding & Dev ToolsMultimodal AIEdge & Mobile AIOther AI / MLFoundation ModelsAI AgentsGenerative MediaComputer Vision

PM Skills

Scale & Reliability

Languages

C++100.0%

Timeline

Project created
May 2, 2019
Forked
Mar 14, 2026
Your last push
1 months ago
Upstream last push
19 days ago
Tracked since
Mar 9, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…