Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/LMCache
Library/LMCacheForked

LMCache/LMCache

LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

View on GitHub↗Upstream LMCache/LMCache↗

Builder

LMCache

LMCache

LMCache • individual

Stars

8,339

Using upstream star count

Forks

1,193

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 28, 2024

Project creation date

README Summary

<div align="center"> <p align="center"> <img src="https://raw.githubusercontent.com/LMCache/LMCache/dev/asset/logo.png" width="720" alt="lmcache logo"> </p> [![Docs](https://img.shields.io/badge/docs-live-brightgreen)](https://docs.lmcache.ai/) [![PyPI](https://img.shields.io/pypi/v/lmcache)](https://pypi.org/project/lmcache/) [![PyPI - Python Version](https://img.shields.io/pypi/pyversions/lmcache)](https://pypi.org/project/lmcache/) [![Unit Tests](https://badge.buildkite.com

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Attention MechanismsDistributed SystemsKey-Value CachingLLM Inference OptimizationMemory ManagementPerformance EngineeringTransformer Architecture

Tags

Attention MechanismsDistributed SystemsKey-Value CachingLLM Inference OptimizationMemory ManagementPerformance EngineeringTransformer ArchitectureCachingForkedKV CacheLLM ServingLarge Language ModelsMLOpsPyTorchPythonReal-Time / StreamingResearch / PapersRoadmapSGLangTutorialvLLM

Taxonomy

AI Trends

LLM OptimizationInference EfficiencyCost-Effective AI

category

Inference & ServingFoundation ModelsModel TrainingMLOps & InfrastructureLearning Resources

Deployment Context

Cloud APISelf-hostedOn-premise

Industries

Developer ToolsCloud Computing

Modalities

Text

Skill Areas

Transformer ArchitectureKey-Value CachingLLM Inference OptimizationDistributed SystemsMemory ManagementAttention MechanismsPerformance Engineering

tag

CachingForkedKV CacheLLM ServingLarge Language ModelsMLOpsPyTorchPythonReal-Time / StreamingResearch / PapersRoadmapSGLangTutorialvLLM

Use Cases

LLM Inference AccelerationCost Reduction for AI ApplicationsLatency Optimization for ChatbotsEfficient Model Serving

Recent Activity

Updated 7 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

prototype
Quality
medium
Maturity
prototype

Categories

Inference & ServingPrimaryMLOps & InfrastructureLearning ResourcesFoundation ModelsModel TrainingML Platform & InfrastructureSearch & KnowledgeOther AI / ML

PM Skills

Cost & EfficiencyScale & Reliability

Languages

Python100.0%

Timeline

Project created
May 28, 2024
Forked
Nov 2, 2025
Your last push
7 months ago
Upstream last push
16 days ago
Tracked since
Nov 2, 2025

Similar Repos

pgvector cosine similarity · $0

Loading…