Library/gpu-perf-engineering-resources
Library/gpu-perf-engineering-resourcesForked

wafer-ai/gpu-perf-engineering-resources

gpu-perf-engineering-resources

A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do

Builder

wafer-ai

wafer-ai

wafer-ai • individual

Stars

512

Using upstream star count

Forks

50

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jan 12, 2026

Project creation date

README Summary

This repository provides a comprehensive curriculum for learning GPU performance engineering from beginner to advanced levels, covering the techniques used by frontier AI labs. It serves as an educational resource with structured learning materials and references for understanding GPU optimization and performance tuning.

AI Dev Skills

Unmapped

GPU Architecture and ProgrammingCUDA ProgrammingMemory Management OptimizationKernel OptimizationParallel ComputingPerformance ProfilingHardware-Software Co-designDeep Learning Systems OptimizationTensor Operations OptimizationDistributed ComputingHigh-Performance Computing

Tags

GPU Architecture and ProgrammingCUDA ProgrammingMemory Management OptimizationKernel OptimizationParallel ComputingPerformance ProfilingHardware-Software Co-designDeep Learning Systems OptimizationTensor Operations OptimizationDistributed ComputingHigh-Performance ComputingEducationHigh Performance ComputingAI InfrastructureDeep Learning InfrastructureGPU Performance OptimizationPerformance Engineering EducationHardware Acceleration TrainingGPU Performance TrainingSelf-hostedDeveloper ToolsHardware AccelerationCloudPerformance EngineeringCUDA Development LearningSystem Architecture DesignAI Infrastructure OptimizationHardware OptimizationMemory ManagementOn-premise

Taxonomy

Recent Activity

Updated 28 days ago

7 Days

0

30 Days

0

90 Days

15

Quality

beta
Quality
medium
Maturity
beta

Categories

Model TrainingPrimaryInference & ServingOther AI / MLDev Tools & AutomationML Platform & Infrastructure

PM Skills

Developer Platform

Languages

No language breakdown recorded.

Timeline

Project created
Jan 12, 2026
Forked
Feb 24, 2026
Your last push
28 days ago
Upstream last push
1 months ago
Tracked since
Mar 17, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…