pytorch/ao
PyTorch native quantization and sparsity for training and inference
Builder
pytorch
pytorch • individual
Stars
2,841
Using upstream star count
Forks
510
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
Nov 3, 2023
Project creation date
PyTorch-Native Training-to-Serving Model Optimization - Pre-train Llama-3.1-70B **1.5x faster** with float8 training - Recover **67% of quantized accuracy degradation** on Gemma3-4B with QAT - Quantize Llama-3-8B to int4 for **1.89x faster** inference with **58% less memory**
Unmapped
category
Deployment Context
Modalities
Skill Areas
tag
Updated 2 months ago
7 Days
0
30 Days
0
90 Days
20
pgvector cosine similarity · $0
Loading…