Library/text-generation-inferenceForked

huggingface/text-generation-inference

text-generation-inference

Large Language Model Text Generation Inference

View on GitHub↗Upstream huggingface/text-generation-inference↗

Builder

HuggingFace

huggingface • ai-lab

Stars

10,873

Using upstream star count

Forks

1,272

Using upstream fork count

Open Issues

Activity Score

0/100

0 commits in 30d

Created

Oct 8, 2022

Project creation date

README Summary

> [!CAUTION] > text-generation-inference is now in maintenance mode. Going forward, we will accept pull requests for minor bug fixes, documentation improvements and lightweight maintenance tasks. > > TGI has initiated the movement for optimized inference engines to rely on a `transformers` model architectures. This approach is now adopted by downstream inference engines, which we contribute to and recommend using going forward: [vllm](https://github.com/vllm-project/vllm), [SGLang](https://githu

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Distributed InferenceGPU AccelerationLarge Language Model DeploymentModel QuantizationModel Serving OptimizationProduction ML SystemsText Generation InferenceTransformer Architecture

Taxonomy

AI Trends

Large Language Models Production AI Systems AI Infrastructure Model Optimization

Recent Activity

Updated 6 months ago

7 Days

30 Days

90 Days

Quality

production

Quality: medium
Maturity: production

PM Skills

Cost & EfficiencyDeveloper PlatformScale & Reliability

Languages

Python100.0%

Timeline

Project created: Oct 8, 2022
Forked: Mar 13, 2026
Your last push: 6 months ago
Upstream last push: 3 months ago
Tracked since: Jan 8, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…

Library/text-generation-inferenceForked

huggingface/text-generation-inference

text-generation-inference

Large Language Model Text Generation Inference

View on GitHub↗Upstream huggingface/text-generation-inference↗

Builder

HuggingFace

huggingface • ai-lab

Stars

10,873

Using upstream star count

Forks

1,272

Using upstream fork count

Open Issues

Activity Score

0/100

0 commits in 30d

Created

Oct 8, 2022

Project creation date

README Summary

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Distributed InferenceGPU AccelerationLarge Language Model DeploymentModel QuantizationModel Serving OptimizationProduction ML SystemsText Generation InferenceTransformer Architecture

Taxonomy

AI Trends

Large Language Models Production AI Systems AI Infrastructure Model Optimization

Recent Activity

Updated 6 months ago

7 Days

30 Days

90 Days

Quality

production

Quality: medium
Maturity: production

PM Skills

Cost & EfficiencyDeveloper PlatformScale & Reliability

Languages

Python100.0%

Timeline

Project created: Oct 8, 2022
Forked: Mar 13, 2026
Your last push: 6 months ago
Upstream last push: 3 months ago
Tracked since: Jan 8, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…

text-generation-inference

README Summary

Community Evaluation

AI Dev Skills

Tags

Taxonomy

Recent Activity

Quality

Categories

PM Skills

Languages

Timeline

Similar Repos

text-generation-inference

README Summary

Community Evaluation

AI Dev Skills

Tags

Taxonomy

Recent Activity

Quality

Categories

PM Skills

Languages

Timeline

Similar Repos