Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/BentoML
Library/BentoMLForked

bentoml/BentoML

BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

View on GitHub↗Upstream bentoml/BentoML↗

Builder

bentoml

bentoml

bentoml • individual

Stars

8,659

Using upstream star count

Forks

968

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Apr 2, 2019

Project creation date

README Summary

<picture> <source media="(prefers-color-scheme: dark)" srcset="https://github.com/bentoml/BentoML/assets/489344/d3e6c95d-d224-49a5-9cff-0789f094e127"> <source media="(prefers-color-scheme: light)" srcset="https://github.com/bentoml/BentoML/assets/489344/de4da660-6aeb-4e5a-bf76-b7177435444d"> <img alt="BentoML: Unified Model Serving Framework" src="https://github.com/bentoml/BentoML/assets/489344/de4da660-6aeb-4e5a-bf76-b7177435444d" width="370" style="max-width: 100%;"> </picture>

Community Evaluation

Loading…

AI Dev Skills

Unmapped

API Development for MLContainerized ML DeploymentDistributed Model ServingLarge Language Model DeploymentModel Inference OptimizationModel Serving InfrastructureMulti-model Pipeline ArchitectureProduction ML Systems

Tags

API Development for MLContainerized ML DeploymentDistributed Model ServingLarge Language Model DeploymentModel Inference OptimizationModel Serving InfrastructureMulti-model Pipeline ArchitectureProduction ML SystemsAPIBatchingComputer VisionControlNetCrewAIDeepSeekDockerEmbeddingsForkedGPU / CUDAImage GenerationLLM ServingLangGraphLlamaMLOpsMachine LearningMistralPyTorchPythonRoadmapStable DiffusionTool UseTransformersTutorialVideo Generation

Taxonomy

AI Trends

Large Language ModelsMulti-model SystemsProduction ML OperationsModel-as-a-ServiceCompound AI Systems

category

Generative MediaFoundation ModelsAI AgentsRAG & RetrievalModel TrainingInference & ServingComputer VisionMLOps & InfrastructureDev Tools & AutomationLearning Resources

Deployment Context

Cloud APISelf-hostedOn-premiseContainerized DeploymentKubernetesServerless

Modalities

TextImageAudioVideoTabularMultimodal

Skill Areas

Model Serving InfrastructureAPI Development for MLMulti-model Pipeline ArchitectureLarge Language Model DeploymentModel Inference OptimizationProduction ML SystemsContainerized ML DeploymentDistributed Model Serving

tag

APIBatchingComputer VisionControlNetCrewAIDeepSeekDockerEmbeddingsForkedGPU / CUDAImage GenerationLLM ServingLangGraphLlamaMLOpsMachine LearningMistralPyTorchPythonRoadmapStable DiffusionTool UseTransformersTutorialVideo Generation

Use Cases

Model Inference API DevelopmentLarge Language Model Application ServingMulti-model Pipeline DeploymentBatch Job Processing for MLReal-time Model Prediction ServicesML Model Versioning and Management

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

2

docs: Update slack to forum link (#5567)

Sherlock Xu • Mar 13, 2026

cbce0f8

feat: add workflow_dispatch input for manual release triggers

Frost Ming • Mar 6, 2026

e1e0f7e

ci: pre-commit autoupdate [skip ci] (#5562)

pre-commit-ci[bot] • Mar 3, 2026

0a8d532

Quality

production
Quality
high
Maturity
production

Categories

Foundation ModelsPrimaryAI AgentsRAG & RetrievalModel TrainingInference & ServingGenerative MediaComputer VisionML Platform & InfrastructureCoding & Dev ToolsOther AI / MLMLOps & InfrastructureDev Tools & AutomationLearning Resources

PM Skills

Cost & EfficiencyScale & ReliabilityProduct DiscoveryDeveloper PlatformAI-Native Architecture

Languages

Python100.0%

Timeline

Project created
Apr 2, 2019
Forked
Mar 22, 2026
Your last push
2 months ago
Upstream last push
27 days ago
Tracked since
Mar 16, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…