Library/OpenLLM
Library/OpenLLMForked

bentoml/OpenLLM

OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Builder

bentoml

bentoml

bentoml • individual

Stars

12,270

Using upstream star count

Forks

805

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Apr 19, 2023

Project creation date

README Summary

OpenLLM is a platform that allows you to run any open-source large language models (LLMs) like DeepSeek, Llama, and others as OpenAI-compatible API endpoints in the cloud. It provides easy deployment and serving capabilities for various open-source LLMs with OpenAI API compatibility. The platform is built on BentoML and supports running models locally or in cloud environments.

AI Dev Skills

Unmapped

Large Language Model DeploymentAPI Gateway DesignModel Serving InfrastructureCloud Computing ArchitectureContainerizationDistributed SystemsREST API DevelopmentModel Inference Optimization

Tags

Large Language Model DeploymentAPI Gateway DesignModel Serving InfrastructureCloud Computing ArchitectureContainerizationDistributed SystemsREST API DevelopmentModel Inference OptimizationCloud-based Language Model HostingLLM API Service DeploymentOn-premiseMulti-model Serving PlatformTextSelf-hostedCloud APISelf-hosted AIOpen Source LLMsOpenAI API Drop-in ReplacementContainer OrchestrationModel Inference as a ServiceAPI StandardizationRESTful API DevelopmentPython

Taxonomy

Recent Activity

Updated 28 days ago

7 Days

0

30 Days

0

90 Days

0

Quality

beta
Quality
medium
Maturity
beta

Categories

Foundation ModelsPrimaryLearning ResourcesInference & ServingML Platform & InfrastructureOther AI / ML

PM Skills

Product Discovery

Languages

Python100.0%

Timeline

Project created
Apr 19, 2023
Forked
Mar 22, 2026
Your last push
28 days ago
Upstream last push
7 days ago
Tracked since
Mar 16, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…