Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/TensorRT
Library/TensorRTForked

NVIDIA/TensorRT

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

View on GitHub↗Upstream NVIDIA/TensorRT↗

Builder

NVIDIA

NVIDIA

NVIDIA • big-tech

Stars

13,024

Using upstream star count

Forks

2,369

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 2, 2019

Project creation date

README Summary

[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0) [![Documentation](https://img.shields.io/badge/TensorRT-documentation-brightgreen.svg)](https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html) [![Roadmap](https://img.shields.io/badge/Roadmap-Q1_2026-brightgreen.svg)](documents/tensorrt_roadmap_2026q1.pdf)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Batch Processing OptimizationCUDA ProgrammingDeep Learning Inference OptimizationGPU ComputingLow-latency InferenceMemory ManagementModel DeploymentModel QuantizationNeural Network AccelerationPerformance Optimization

Tags

Batch Processing OptimizationCUDA ProgrammingDeep Learning Inference OptimizationGPU ComputingLow-latency InferenceMemory ManagementModel DeploymentModel QuantizationNeural Network AccelerationPerformance OptimizationData ScienceDeep LearningDockerForkedGPU / CUDAModel OptimizationNumPyONNXOpen SourcePythonQuantizationRoadmapTensorFlowTensorRT

Taxonomy

AI Trends

On-device AIModel OptimizationProduction AIEdge Computing

category

Inference & ServingFoundation ModelsModel TrainingMLOps & InfrastructureLearning ResourcesData Science & Analytics

Deployment Context

Cloud APISelf-hostedEdge/MobileOn-premise

Modalities

ImageVideoTextAudioMultimodal

Skill Areas

Deep Learning Inference OptimizationGPU ComputingModel QuantizationCUDA ProgrammingNeural Network AccelerationPerformance OptimizationModel DeploymentLow-latency InferenceBatch Processing OptimizationMemory Management

tag

Data ScienceDeep LearningDockerForkedGPU / CUDAModel OptimizationNumPyONNXOpen SourcePythonQuantizationRoadmapTensorFlowTensorRT

Use Cases

Real-time AI InferenceModel Serving OptimizationLatency-critical ApplicationsHigh-throughput Batch ProcessingProduction Model DeploymentEdge AI AccelerationAutonomous Vehicle InferenceComputer Vision ApplicationsNatural Language Processing Acceleration

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

2

Add an additional include path for safety headers (#4716) (#4717)

Kevin Chen • Mar 9, 2026

aa76a58

bugfix: include pyproject.toml in demoDiffusion (#4707)

asfiyab-nvidia • Mar 5, 2026

0da1458

Add UNIFIED_BUILDER code for safety samples (#4704) (#4706)

Kevin Chen • Feb 26, 2026

bdafad3

Quality

production
Quality
high
Maturity
production

Categories

Inference & ServingPrimaryMLOps & InfrastructureLearning ResourcesData Science & AnalyticsFoundation ModelsModel TrainingEdge & Mobile AIOther AI / ML

PM Skills

Cost & EfficiencyScale & ReliabilityData & Evaluation

Languages

C++100.0%

Timeline

Project created
May 2, 2019
Forked
Mar 14, 2026
Your last push
2 months ago
Upstream last push
1 months ago
Tracked since
Mar 9, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…