Library/inferenceForked

xorbitsai/inference

inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

View on GitHub↗Upstream xorbitsai/inference↗

Builder

xorbitsai

xorbitsai • individual

Stars

9,434

Using upstream star count

Forks

844

Using upstream fork count

Open Issues

Activity Score

0/100

0 commits in 30d

Created

Jun 14, 2023

Project creation date

README Summary

Community Evaluation

Loading…

AI Dev Skills

Unmapped

API Gateway DesignDistributed Model InferenceLarge Language Model DeploymentModel Serving InfrastructureMultimodal AI SystemsMulti-Model OrchestrationProduction MLOpsSpeech-to-Text Integration

Taxonomy

AI Trends

Model Interoperability Open Source LLMs Unified AI Interfaces On-premise AI Hybrid Cloud AI

Recent Activity

Updated 3 months ago

7 Days

30 Days

90 Days

ENH: update models JSON [llm] (#4710)

XprobeBot • Mar 21, 2026

8b97828

fix(qwen3.5): support tool calls (#4709)

llyycchhee • Mar 21, 2026

1e55151

ENH: update model "qwen3.5" JSON (#4707)

llyycchhee • Mar 21, 2026

a6b1345

Quality

production

Quality: high
Maturity: production

PM Skills

Cost & EfficiencyUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper PlatformAI-Native Architecture

Languages

Python100.0%

Timeline

Project created: Jun 14, 2023
Forked: Mar 22, 2026
Your last push: 3 months ago
Upstream last push: 2 months ago
Tracked since: Mar 21, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…

Library/inferenceForked

xorbitsai/inference

inference

View on GitHub↗Upstream xorbitsai/inference↗

Builder

xorbitsai

xorbitsai • individual

Stars

9,434

Using upstream star count

Forks

844

Using upstream fork count

Open Issues

Activity Score

0/100

0 commits in 30d

Created

Jun 14, 2023

Project creation date

README Summary

Community Evaluation

Loading…

AI Dev Skills

Unmapped

API Gateway DesignDistributed Model InferenceLarge Language Model DeploymentModel Serving InfrastructureMultimodal AI SystemsMulti-Model OrchestrationProduction MLOpsSpeech-to-Text Integration

Taxonomy

AI Trends

Model Interoperability Open Source LLMs Unified AI Interfaces On-premise AI Hybrid Cloud AI

Recent Activity

Updated 3 months ago

7 Days

30 Days

90 Days

ENH: update models JSON [llm] (#4710)

XprobeBot • Mar 21, 2026

8b97828

fix(qwen3.5): support tool calls (#4709)

llyycchhee • Mar 21, 2026

1e55151

ENH: update model "qwen3.5" JSON (#4707)

llyycchhee • Mar 21, 2026

a6b1345

Quality

production

Quality: high
Maturity: production

PM Skills

Cost & EfficiencyUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper PlatformAI-Native Architecture

Languages

Python100.0%

Timeline

Project created: Jun 14, 2023
Forked: Mar 22, 2026
Your last push: 3 months ago
Upstream last push: 2 months ago
Tracked since: Mar 21, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…

inference

README Summary

Community Evaluation

AI Dev Skills

Tags

Taxonomy

Recent Activity

Quality

Categories

PM Skills

Languages

Timeline

Similar Repos

inference

README Summary

Community Evaluation

AI Dev Skills

Tags

Taxonomy

Recent Activity

Quality

Categories

PM Skills

Languages

Timeline

Similar Repos