Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/llmfit
Library/llmfitForked

AlexsJones/llmfit

llmfit

Hundreds of models & providers. One command to find what runs on your hardware.

View on GitHub↗Upstream AlexsJones/llmfit↗

Builder

AlexsJones

AlexsJones

AlexsJones • individual

Stars

26,858

Using upstream star count

Forks

1,632

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Feb 15, 2026

Project creation date

README Summary

<p align="center"> <img src="assets/icon.svg" alt="llmfit icon" width="128" height="128"> </p> <p align="center"> <a href="https://github.com/AlexsJones/llmfit/actions/workflows/ci.yml"><img src="https://github.com/AlexsJones/llmfit/actions/workflows/ci.yml/badge.svg" alt="CI"></a> <a href="https://crates.io/crates/llmfit"><img src="https://img.shields.io/crates/v/llmfit.svg" alt="Crates.io"></a> <a href="LICENSE"><img src="https://img.shields.io/badge/license-MIT-blue.svg" alt="License"

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Cross-Platform Model DeploymentHardware Performance OptimizationLanguage Model BenchmarkingModel Provider IntegrationModel Selection and EvaluationResource-Constrained ML

Tags

Cross-Platform Model DeploymentHardware Performance OptimizationLanguage Model BenchmarkingModel Provider IntegrationModel Selection and EvaluationResource-Constrained MLAI SafetyAPIBenchmarkingC++CLI ToolContext EngineeringDatabaseDeepSeekDockerEmbeddingsEvalsForkedGPU / CUDAGemmaHuggingFaceKV CacheKubernetesLLM ServingLarge Language ModelsLlamaMistralMultimodal AINode.jsOllamaPhiPlanning / CoTPythonQuantizationQwenReal-Time / StreamingUnslothllama.cppvLLM

Taxonomy

AI Trends

On-device AISmall Language ModelsHardware-Efficient AI

category

Foundation ModelsAI AgentsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingMLOps & InfrastructureDev Tools & AutomationSecurity & Safety

Deployment Context

Self-hostedEdge/MobileOn-premise

Industries

Developer Tools

Modalities

Text

Skill Areas

Language Model BenchmarkingHardware Performance OptimizationModel Selection and EvaluationCross-Platform Model DeploymentResource-Constrained MLModel Provider Integration

tag

AI SafetyAPIBenchmarkingC++CLI ToolContext EngineeringDatabaseDeepSeekDockerEmbeddingsEvalsForkedGPU / CUDAGemmaHuggingFaceKV CacheKubernetesLLM ServingLarge Language ModelsLlamaMistralMultimodal AINode.jsOllamaPhiPlanning / CoTPythonQuantizationQwenReal-Time / StreamingUnslothllama.cppvLLM

Use Cases

Hardware Compatibility TestingModel Performance BenchmarkingPre-deployment Model SelectionResource Optimization Planning

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

20

chore: version bump

AlexsJones • Mar 12, 2026

eb5eadd

chore: version bump

AlexsJones • Mar 12, 2026

340b412

Merge pull request #196 from bgupta/fix/awq-gptq-format-support

Alex Jones • Mar 11, 2026

0ba8b96

Quality

prototype
Quality
medium
Maturity
prototype

Categories

RAG & RetrievalPrimaryEvals & BenchmarkingInference & ServingMLOps & InfrastructureDev Tools & AutomationSecurity & SafetyFoundation ModelsAI AgentsModel TrainingSafety & AlignmentMultimodal AIOther AI / ML

PM Skills

Cost & EfficiencySafety & AlignmentUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper PlatformAI-Native Architecture

Languages

Rust100.0%

Timeline

Project created
Feb 15, 2026
Forked
Mar 12, 2026
Your last push
2 months ago
Upstream last push
16 days ago
Tracked since
Mar 12, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…