Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/airllm
Library/airllmForked

lyogavin/airllm

airllm

AirLLM 70B inference with single 4GB GPU

View on GitHub↗Upstream lyogavin/airllm↗

Builder

lyogavin

lyogavin

lyogavin • individual

Stars

18,406

Using upstream star count

Forks

2,002

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jun 12, 2023

Project creation date

README Summary

[**Quickstart**](#quickstart) | [**Configurations**](#configurations) | [**MacOS**](#macos) | [**Example notebooks**](#example-python-notebook) | [**FAQ**](#faq)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

GPU Memory ManagementLarge Language Model InferenceLow-Resource AI DeploymentMemory-Efficient ComputingModel OptimizationTransformer Architecture

Tags

GPU Memory ManagementLarge Language Model InferenceLow-Resource AI DeploymentMemory-Efficient ComputingModel OptimizationTransformer ArchitectureBenchmarkingDistillationEvalsForkedGPU / CUDAHuggingFaceJupyterLarge Language ModelsLlamaMistralPyTorchPythonQuantizationQwenResearch / PapersRustTransformers

Taxonomy

AI Trends

On-device AIDemocratized AI AccessResource-Efficient AI

category

Foundation ModelsModel TrainingEvals & BenchmarkingInference & ServingLearning ResourcesData Science & Analytics

Deployment Context

Self-hostedLocal Development

Modalities

Text

Skill Areas

Large Language Model InferenceMemory-Efficient ComputingGPU Memory ManagementModel OptimizationTransformer ArchitectureLow-Resource AI Deployment

tag

BenchmarkingDistillationEvalsForkedGPU / CUDAHuggingFaceJupyterLarge Language ModelsLlamaMistralPyTorchPythonQuantizationQwenResearch / PapersRustTransformers

Use Cases

Local LLM InferenceCost-Effective AI DeploymentResearch on Consumer HardwareEducational AI Experimentation

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

1

Add funding.json for open source funding transparency

Yu Li • Mar 10, 2026

aa2a6f6

Quality

prototype
Quality
medium
Maturity
prototype

Categories

Evals & BenchmarkingPrimaryInference & ServingLearning ResourcesData Science & AnalyticsFoundation ModelsModel TrainingSearch & Knowledge

PM Skills

Cost & EfficiencyData & Evaluation

Languages

Jupyter Notebook100.0%

Timeline

Project created
Jun 12, 2023
Forked
Mar 16, 2026
Your last push
2 months ago
Upstream last push
2 months ago
Tracked since
Mar 10, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…