InternLM/lmdeploy
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Builder

InternLM
InternLM • individual
Stars
7,747
Using upstream star count
Forks
678
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
Jun 15, 2023
Project creation date
README Summary
LMDeploy is a comprehensive toolkit designed for compressing, deploying, and serving Large Language Models (LLMs). It provides efficient solutions for model optimization and deployment in production environments. The toolkit focuses on making LLM deployment more accessible and performant for various use cases.
AI Dev Skills
Unmapped
Large Language Model DeploymentModel Compression and QuantizationHigh-Performance Inference ServingModel OptimizationProduction ML SystemsGPU AccelerationDistributed Model ServingTransformer Model Optimization
Tags
Large Language Model DeploymentModel Compression and QuantizationHigh-Performance Inference ServingModel OptimizationProduction ML SystemsGPU AccelerationDistributed Model ServingTransformer Model OptimizationHigh-throughput Text GenerationProduction AI SystemsLarge-scale Language Model DeploymentEfficient AI InferenceModel Compression and OptimizationModel Inference OptimizationEdge AI DeploymentOn-premiseModel QuantizationLLM Performance TuningResource-constrained LLM ServingCloud APIModel Serving InfrastructureSelf-hostedProduction LLM API ServingLarge Language ModelsEnterprise LLM InfrastructureEdge/MobileTextGPU Memory ManagementInference OptimizationPython
Taxonomy
AI Trends
Deployment Context
Modalities
Skill Areas
Recent Activity
Updated 23 days ago
7 Days
0
30 Days
0
90 Days
0
Quality
production- Quality
- high
- Maturity
- production
Categories
Dev Tools & AutomationPrimaryInference & ServingML Platform & InfrastructureEdge & Mobile AIOther AI / MLFoundation Models
PM Skills
Developer Platform
Languages
Python100.0%
Timeline
- Project created
- Jun 15, 2023
- Forked
- Mar 22, 2026
- Your last push
- 23 days ago
- Upstream last push
- 6 days ago
- Tracked since
- Mar 22, 2026
Similar Repos
pgvector cosine similarity · $0
Loading…