Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/ColossalAI
Library/ColossalAIForked

hpcaitech/ColossalAI

ColossalAI

Making large AI models cheaper, faster and more accessible

View on GitHub↗Upstream hpcaitech/ColossalAI↗

Builder

hpcaitech

hpcaitech

hpcaitech • individual

Stars

41,383

Using upstream star count

Forks

4,504

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Oct 28, 2021

Project creation date

README Summary

[![logo](https://raw.githubusercontent.com/hpcaitech/public_assets/main/colossalai/img/colossal-ai_logo_vertical.png)](https://www.colossalai.org/)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Cluster ComputingCUDA ProgrammingData ParallelismDistributed Deep LearningGradient CompressionInference OptimizationLarge Language Model TrainingMemory OptimizationMixed Precision TrainingModel Fine-tuningModel ParallelismModel ShardingMulti-GPU TrainingPipeline ParallelismTensor ParallelismTransformer ArchitectureZero Redundancy Optimization

Tags

Cluster ComputingCUDA ProgrammingData ParallelismDistributed Deep LearningGradient CompressionInference OptimizationLarge Language Model TrainingMemory OptimizationMixed Precision TrainingModel Fine-tuningModel ParallelismModel ShardingMulti-GPU TrainingPipeline ParallelismTensor ParallelismTransformer ArchitectureZero Redundancy OptimizationBenchmarkingC++Curated ListDeep LearningDeepSeekDockerEmbeddingsEvalsForkedGPU / CUDAHuggingFaceImage GenerationLLM ServingLarge Language ModelsLlamaLoRA / PEFTMMLUOpen SourceOpenAIPyTorchPythonQwenRLHFReinforcement LearningResearch / PapersStable DiffusionTransformersTutorialVideo GenerationvLLM

Taxonomy

AI Trends

Large Language ModelsFoundation ModelsEfficient AI TrainingModel DemocratizationDistributed AI Systems

category

Foundation ModelsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingGenerative MediaMLOps & InfrastructureLearning Resources

Deployment Context

CloudOn-premiseMulti-GPU ClustersDistributed Systems

Modalities

TextImageMultimodal

Skill Areas

Distributed Deep LearningModel ParallelismData ParallelismPipeline ParallelismMemory OptimizationLarge Language Model TrainingTransformer ArchitectureMixed Precision TrainingGradient CompressionModel ShardingZero Redundancy OptimizationTensor ParallelismMulti-GPU TrainingCluster ComputingCUDA ProgrammingModel Fine-tuningInference Optimization

tag

BenchmarkingC++Curated ListDeep LearningDeepSeekDockerEmbeddingsEvalsForkedGPU / CUDAHuggingFaceImage GenerationLLM ServingLarge Language ModelsLlamaLoRA / PEFTMMLUOpen SourceOpenAIPyTorchPythonQwenRLHFReinforcement LearningResearch / PapersStable DiffusionTransformersTutorialVideo GenerationvLLM

Use Cases

Large Language Model Pre-trainingDistributed Model Fine-tuningMulti-billion Parameter Model TrainingMemory-efficient Model InferenceHigh-throughput Model ServingResearch Model ExperimentationProduction Model Deployment

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

Foundation ModelsPrimaryRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingGenerative MediaCoding & Dev ToolsSearch & KnowledgeOther AI / MLMLOps & InfrastructureLearning Resources

PM Skills

Cost & EfficiencyScale & ReliabilityData & EvaluationProduct Discovery

Languages

Python100.0%

Timeline

Project created
Oct 28, 2021
Forked
Mar 22, 2026
Your last push
2 months ago
Upstream last push
23 days ago
Tracked since
Mar 16, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…