sgl-project/sglang
sglang
SGLang is a high-performance serving framework for large language models and multimodal models.
Builder

sgl-project
sgl-project • individual
Stars
25,363
Using upstream star count
Forks
5,144
Using upstream fork count
Open Issues
0
Activity Score
0/100
1140 commits in 30d
Created
Jan 8, 2024
Project creation date
README Summary
SGLang is a high-performance serving framework designed for large language models and multimodal models. It provides efficient inference and serving capabilities for AI models at scale. The framework focuses on optimizing performance and resource utilization for production deployments.
AI Dev Skills
Unmapped
Large Language Model ServingModel Inference OptimizationMultimodal Model DeploymentHigh-Performance ComputingDistributed SystemsGPU AccelerationModel Serving ArchitectureInference Engine Development
Tags
Large Language Model ServingModel Inference OptimizationMultimodal Model DeploymentHigh-Performance ComputingDistributed SystemsGPU AccelerationModel Serving ArchitectureInference Engine DevelopmentMultimodal AIMultimodalProduction AI Model DeploymentLLM API ServingDeveloper ToolsMultimodal Model InferenceOn-premiseSelf-hostedScalable AI Model HostingModel Serving OptimizationAI InfrastructureLarge Language ModelsHigh-Throughput Text GenerationCloud APIAI Platform ServicesTextCloud InfrastructurePython
Taxonomy
Deployment Context
Modalities
Skill Areas
Recent Activity
Updated 1 months ago
7 Days
252
30 Days
1140
90 Days
3433
Quality
beta- Quality
- high
- Maturity
- beta
Categories
Dev Tools & AutomationPrimaryInference & ServingML Platform & InfrastructureMultimodal AIOther AI / MLFoundation Models
PM Skills
Developer Platform
Languages
Python100.0%
Timeline
- Project created
- Jan 8, 2024
- Forked
- Mar 13, 2026
- Your last push
- 1 months ago
- Upstream last push
- 6 days ago
- Tracked since
- Mar 13, 2026
Similar Repos
pgvector cosine similarity · $0
Loading…