Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/omlx
Library/omlxForked

jundot/omlx

omlx

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

View on GitHub↗Upstream jundot/omlx↗

Builder

jundot

jundot

jundot • individual

Stars

17,179

Using upstream star count

Forks

1,458

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

—

README Summary

<p align="center"> <picture> <source media="(prefers-color-scheme: dark)" srcset="docs/images/icon-rounded-dark.svg" width="140"> <source media="(prefers-color-scheme: light)" srcset="docs/images/icon-rounded-light.svg" width="140"> <img alt="oMLX" src="docs/images/icon-rounded-light.svg" width="140"> </picture> </p>

Community Evaluation

Loading…

AI Dev Skills

No AI dev skills recorded.

Tags

ActiveAnthropic / ClaudeBatchingBenchmarkingCachingClaudeClaude CodeDeepSeekEmbeddingsEvalsForkedGemmaHuggingFaceKV CacheLarge Language ModelsLLM ServingMCPMistralMLOpsOpenAIPythonPython Web FrameworkQwenReal-Time / StreamingReasoning ModelsRerankingSpeculative DecodingStructured OutputTool UsevLLM

Taxonomy

category

Inference & ServingFoundation ModelsAI AgentsRAG & RetrievalEvals & BenchmarkingMLOps & InfrastructureDev Tools & Automation

tag

ActiveAnthropic / ClaudeBatchingBenchmarkingCachingClaudeClaude CodeDeepSeekEmbeddingsEvalsForkedGemmaHuggingFaceKV CacheLLM ServingLarge Language ModelsMCPMLOpsMistralOpenAIPythonPython Web FrameworkQwenReal-Time / StreamingReasoning ModelsRerankingSpeculative DecodingStructured OutputTool UsevLLM

Recent Activity

Updated 1 months ago

7 Days

0

30 Days

0

90 Days

20

fix(vlm): route OCR processors around torch-gated AutoImageProcessor

jundot • May 11, 2026

a1987ed

observability(speculative): log per-request vlm_mtp acceptance stats

jundot • May 11, 2026

a7b2082

ui(speculative): tighten vlm_mtp toggle UX

jundot • May 11, 2026

70734e1

Quality

Quality signals are not available for this repo yet.

Categories

Foundation ModelsPrimaryAI AgentsRAG & RetrievalEvals & BenchmarkingInference & ServingML Platform & InfrastructureOther AI / MLMLOps & InfrastructureDev Tools & Automation

PM Skills

Cost & EfficiencyData & EvaluationDeveloper PlatformProduct DiscoveryScale & Reliability

Languages

Python100.0%

Timeline

Project created
—
Forked
May 11, 2026
Your last push
1 months ago
Upstream last push
1 months ago
Tracked since
May 11, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…