Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/ai-performance-engineering
Library/ai-performance-engineeringForked

cfregly/ai-performance-engineering

ai-performance-engineering

_**Update:** Are you interested in a hands-on course for this material?_

View on GitHub↗Upstream cfregly/ai-performance-engineering↗

Builder

cfregly

cfregly

cfregly • individual

Stars

1,536

Using upstream star count

Forks

216

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Apr 21, 2025

Project creation date

README Summary

_**Update:** Are you interested in a hands-on course for this material?_

Community Evaluation

Loading…

AI Dev Skills

Unmapped

AI Performance OptimizationMachine Learning Systems EngineeringModel Performance Tuning

Tags

AI Performance OptimizationMachine Learning Systems EngineeringModel Performance TuningAI AgentsAWSBackendBenchmarkingC++CachingCourseData ScienceDockerForkedFSDPGPU / CUDAInferenceKubernetesKV CacheLarge Language ModelsLLM ServingModel OptimizationNode.jsOpenAIPlanning / CoTPythonPyTorchQuantizationReal-Time / StreamingReinforcement LearningSGLangTensorRTTransformersvLLM

Taxonomy

AI Trends

AI Performance EngineeringModel Optimization

category

Inference & ServingFoundation ModelsAI AgentsModel TrainingEvals & BenchmarkingMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsLearning ResourcesData Science & Analytics

Skill Areas

AI Performance OptimizationMachine Learning Systems EngineeringModel Performance Tuning

tag

AI AgentsAWSBackendBenchmarkingC++CachingCourseData ScienceDockerFSDPForkedGPU / CUDAInferenceKV CacheKubernetesLLM ServingLarge Language ModelsModel OptimizationNode.jsOpenAIPlanning / CoTPyTorchPythonQuantizationReal-Time / StreamingReinforcement LearningSGLangTensorRTTransformersvLLM

Use Cases

AI Model Performance OptimizationML System Performance Analysis

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

20

added fabric analysis and profiling and benchmarking including spectrum-x and infiniband

Chris Fregly • Mar 16, 2026

7adca4b

chore: checkpoint benchmark refactors

Chris Fregly • Mar 16, 2026

cd01d4c

fix: harden warning suppression and artifact readers

Chris Fregly • Mar 16, 2026

1d8f820

Quality

research
Quality
low
Maturity
research

Categories

Inference & ServingPrimaryFoundation ModelsAI AgentsModel TrainingEvals & BenchmarkingMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsLearning ResourcesData Science & AnalyticsOther AI / ML

PM Skills

AI-Native ArchitectureCost & EfficiencyData & EvaluationScale & Reliability

Languages

Python100.0%

Timeline

Project created
Apr 21, 2025
Forked
Feb 24, 2026
Your last push
2 months ago
Upstream last push
2 months ago
Tracked since
Mar 17, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…