Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/gpt-neox
Library/gpt-neoxForked

EleutherAI/gpt-neox

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

View on GitHub↗Upstream EleutherAI/gpt-neox↗

Builder

EleutherAI

EleutherAI

EleutherAI • ai-lab

Stars

7,430

Using upstream star count

Forks

1,113

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Dec 22, 2020

Project creation date

README Summary

[![GitHub issues](https://img.shields.io/github/issues/EleutherAI/gpt-neox)](https://github.com/EleutherAI/gpt-neox/issues) [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Weights & Biases monitoring" height=20>](https://wandb.ai/eleutherai/neox)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Autoregressive Language ModelingDistributed TrainingGPU OptimizationGradient AccumulationLarge Language Model TrainingMemory-Efficient TrainingMixed Precision TrainingModel ParallelismTransformer Architecture

Tags

Autoregressive Language ModelingDistributed TrainingGPU OptimizationGradient AccumulationLarge Language Model TrainingMemory-Efficient TrainingMixed Precision TrainingModel ParallelismTransformer ArchitectureAWSCLI ToolCourseDPODeep LearningDeepSpeedDockerEmbeddingsEvalsFine-TuningForkedGPU / CUDAHuggingFaceKubernetesLM Eval HarnessLarge Language ModelsMachine LearningMistralMultimodal AINode.jsOpen SourceOpenAIPyTorchPythonRLHFReinforcement LearningResearch / PapersTensorFlowTransformersTutorialWeights & Biases

Taxonomy

AI Trends

Large Language ModelsDistributed AI TrainingOpen Source AI ModelsEfficient AI Training

category

Model TrainingFoundation ModelsRAG & RetrievalEvals & BenchmarkingObservability & MonitoringInference & ServingMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsLearning Resources

Deployment Context

Self-hostedOn-premiseCloud GPU Clusters

Modalities

Text

Skill Areas

Transformer ArchitectureModel ParallelismDistributed TrainingLarge Language Model TrainingGPU OptimizationAutoregressive Language ModelingMixed Precision TrainingGradient AccumulationMemory-Efficient Training

tag

AWSCLI ToolCourseDPODeep LearningDeepSpeedDockerEmbeddingsEvalsFine-TuningForkedGPU / CUDAHuggingFaceKubernetesLM Eval HarnessLarge Language ModelsMachine LearningMistralMultimodal AINode.jsOpen SourceOpenAIPyTorchPythonRLHFReinforcement LearningResearch / PapersTensorFlowTransformersTutorialWeights & Biases

Use Cases

Large Language Model TrainingDistributed Transformer TrainingMulti-GPU Model ParallelismResearch-Scale Language Model DevelopmentCustom Language Model Training

Recent Activity

Updated 4 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

RAG & RetrievalPrimaryEvals & BenchmarkingObservability & MonitoringInference & ServingMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsLearning ResourcesFoundation ModelsModel TrainingSafety & AlignmentMultimodal AISearch & KnowledgeOther AI / ML

PM Skills

User ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper Platform

Languages

Python100.0%

Timeline

Project created
Dec 22, 2020
Forked
Mar 22, 2026
Your last push
4 months ago
Upstream last push
1 months ago
Tracked since
Feb 3, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…