Library/DeepSpeed
Library/DeepSpeedForked

deepspeedai/DeepSpeed

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Builder

deepspeedai

deepspeedai

deepspeedai • individual

Stars

41,975

Using upstream star count

Forks

4,772

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jan 23, 2020

Project creation date

README Summary

DeepSpeed is a deep learning optimization library developed by Microsoft that enables efficient distributed training and inference for large-scale models. It provides memory optimization techniques, model parallelism, and gradient compression to make training billion-parameter models accessible on limited hardware. The library integrates with popular frameworks like PyTorch and Transformers to accelerate deep learning workflows.

AI Dev Skills

Unmapped

Distributed Deep LearningModel ParallelismData ParallelismPipeline ParallelismMemory OptimizationGradient CompressionMixed Precision TrainingZero Redundancy OptimizerLarge Language Model TrainingTransformer ArchitectureGPU Memory ManagementDistributed SystemsHigh Performance ComputingNeural Network OptimizationScalable Machine Learning

Tags

Distributed Deep LearningModel ParallelismData ParallelismPipeline ParallelismMemory OptimizationGradient CompressionMixed Precision TrainingZero Redundancy OptimizerLarge Language Model TrainingTransformer ArchitectureGPU Memory ManagementDistributed SystemsHigh Performance ComputingNeural Network OptimizationScalable Machine LearningDistributed AI SystemsCommunication Backend OptimizationZeRO Optimizer StatesLarge Language ModelsMemory-Efficient Model InferenceMulti-GPU ClustersGradient Synchronization OptimizationCloudMultimodalInference AccelerationAccelerated Deep Learning InferenceMemory-Efficient AITextMulti-GPU Model ParallelizationEfficient AI TrainingOn-premiseDistributed Neural Network TrainingPython

Taxonomy

Recent Activity

Updated 22 days ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

MLOps & InfrastructurePrimaryDev Tools & AutomationInference & ServingML Platform & InfrastructureMultimodal AIOther AI / MLFoundation ModelsModel Training

PM Skills

Scale & ReliabilityDeveloper Platform

Languages

Python100.0%

Timeline

Project created
Jan 23, 2020
Forked
Mar 22, 2026
Your last push
22 days ago
Upstream last push
7 days ago
Tracked since
Mar 22, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…