Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/apex
Library/apexForked

NVIDIA/apex

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

View on GitHub↗Upstream NVIDIA/apex↗

Builder

NVIDIA

NVIDIA

NVIDIA • big-tech

Stars

8,971

Using upstream star count

Forks

1,520

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Apr 23, 2018

Project creation date

README Summary

This repository holds NVIDIA-maintained utilities to streamline mixed precision and distributed training in Pytorch. Some of the code here will be included in upstream Pytorch eventually. The intent of Apex is to make up-to-date utilities available to users as quickly as possible.

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Automatic Mixed Precision (AMP)Distributed Deep LearningGPU Memory OptimizationMixed Precision TrainingMulti-GPU TrainingNeural Network Training AccelerationPyTorch Optimization

Tags

Automatic Mixed Precision (AMP)Distributed Deep LearningGPU Memory OptimizationMixed Precision TrainingMulti-GPU TrainingNeural Network Training AccelerationPyTorch OptimizationC++CLI ToolForkedGPU / CUDAPyTorchPython

Taxonomy

AI Trends

Large Language ModelsEfficient AI TrainingScalable Deep Learning

category

Model TrainingInference & ServingDev Tools & Automation

Deployment Context

CloudOn-premiseMulti-GPU Systems

Modalities

TextImageAudioVideoMultimodal

Skill Areas

Mixed Precision TrainingDistributed Deep LearningGPU Memory OptimizationNeural Network Training AccelerationAutomatic Mixed Precision (AMP)Multi-GPU TrainingPyTorch Optimization

tag

C++CLI ToolForkedGPU / CUDAPyTorchPython

Use Cases

Large Language Model TrainingComputer Vision Model TrainingDeep Neural Network OptimizationMulti-GPU Model TrainingMemory-Efficient Training

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

2

Fix lerp overload ambiguity with std::lerp under C++20 (#1985)

Wang, Xiao • Mar 10, 2026

f199212

Fix divide-by-zero in GroupNorm two-pass kernel for large batch sizes (#1984)

Tailing Yuan • Mar 5, 2026

dbe421e

Deprecate apex.contrib.fmha and apex.contrib.multihead_attn (#1932)

Aidyn-A • Mar 4, 2026

212061e

Quality

production
Quality
high
Maturity
production

Categories

Inference & ServingPrimaryDev Tools & AutomationModel Training

PM Skills

Developer Platform

Languages

Python100.0%

Timeline

Project created
Apr 23, 2018
Forked
Mar 14, 2026
Your last push
2 months ago
Upstream last push
21 days ago
Tracked since
Mar 10, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…