Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/omlx
Library/omlxForked

jundot/omlx

omlx

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

View on GitHub↗Upstream jundot/omlx↗

Builder

jundot

jundot

jundot • individual

Stars

13,846

Using upstream star count

Forks

1,177

Using upstream fork count

Open Issues

0

Activity Score

0/100

20 commits in 30d

Created

—

README Summary

<p align="center"> <picture> <source media="(prefers-color-scheme: dark)" srcset="docs/images/icon-rounded-dark.svg" width="140"> <source media="(prefers-color-scheme: light)" srcset="docs/images/icon-rounded-light.svg" width="140"> <img alt="oMLX" src="docs/images/icon-rounded-light.svg" width="140"> </picture> </p>

Community Evaluation

Loading…

AI Dev Skills

No AI dev skills recorded.

Tags

ActiveAnthropic / ClaudeBatchingBenchmarkingCachingClaudeClaude CodeDeepSeekEmbeddingsEvalsForkedGemmaHuggingFaceKV CacheLLM ServingLarge Language ModelsMCPMLOpsMistralOpenAIPythonPython Web FrameworkQwenReal-Time / StreamingReasoning ModelsRerankingSpeculative DecodingStructured OutputTool UsevLLM

Taxonomy

category

Inference & ServingFoundation ModelsAI AgentsRAG & RetrievalEvals & BenchmarkingMLOps & InfrastructureDev Tools & Automation

tag

ActiveAnthropic / ClaudeBatchingBenchmarkingCachingClaudeClaude CodeDeepSeekEmbeddingsEvalsForkedGemmaHuggingFaceKV CacheLLM ServingLarge Language ModelsMCPMLOpsMistralOpenAIPythonPython Web FrameworkQwenReal-Time / StreamingReasoning ModelsRerankingSpeculative DecodingStructured OutputTool UsevLLM

Recent Activity

Updated 3 days ago

7 Days

20

30 Days

20

90 Days

20

fix(vlm): route OCR processors around torch-gated AutoImageProcessor

jundot • May 11, 2026

a1987ed

observability(speculative): log per-request vlm_mtp acceptance stats

jundot • May 11, 2026

a7b2082

ui(speculative): tighten vlm_mtp toggle UX

jundot • May 11, 2026

70734e1

Quality

Quality signals are not available for this repo yet.

Categories

Inference & ServingPrimaryFoundation ModelsAI AgentsRAG & RetrievalEvals & BenchmarkingMLOps & InfrastructureDev Tools & Automation

PM Skills

Cost & EfficiencyScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper Platform

Languages

Python100.0%

Timeline

Project created
—
Forked
May 11, 2026
Your last push
3 days ago
Upstream last push
3 days ago
Tracked since
May 11, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…