Library/vllm-omniForked

vllm-project/vllm-omni

vllm-omni

A framework for efficient model inference with omni-modality models

View on GitHub↗Upstream vllm-project/vllm-omni↗

Builder

vLLM

vllm-project • startup

Stars

4,841

Using upstream star count

Forks

1,018

Using upstream fork count

Open Issues

Activity Score

0/100

0 commits in 30d

Created

Sep 11, 2025

Project creation date

README Summary

<p align="center"> <picture> <source media="(prefers-color-scheme: dark)" srcset="https://raw.githubusercontent.com/vllm-project/vllm-omni/refs/heads/main/docs/source/logos/vllm-omni-logo.png"> <img alt="vllm-omni" src="https://raw.githubusercontent.com/vllm-project/vllm-omni/refs/heads/main/docs/source/logos/vllm-omni-logo.png" width=55%> </picture> </p> <h3 align="center"> Easy, fast, and cheap omni-modality model serving for everyone </h3>

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Taxonomy

AI Trends

Multimodal Reasoning Large Language Models Model Serving Infrastructure GPU-Accelerated Inference

Recent Activity

Updated 2 months ago

7 Days

30 Days

90 Days

[Bugfix] Restore chunk-waiting requests on OmniNewRequestData rewrap failure (#1691)

Du Bin • Mar 22, 2026

b4a96b0

[CI] Add Flux2 Klein Tests (#2027)

Alex Brooks • Mar 22, 2026

a5574a2

[FP8] enable hunyuan-image-3 diffusion model with fp8 online quant (#1935)

Chendi.Xue • Mar 22, 2026

28aee51

Quality

prototype

Quality: medium
Maturity: prototype

PM Skills

Cost & EfficiencySafety & AlignmentUser ExperienceScale & ReliabilityData & EvaluationAI-Native Architecture

Languages

Python100.0%

Timeline

Project created: Sep 11, 2025
Forked: Mar 22, 2026
Your last push: 2 months ago
Upstream last push: 16 days ago
Tracked since: Mar 22, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…

Library/vllm-omniForked

vllm-project/vllm-omni

vllm-omni

A framework for efficient model inference with omni-modality models

View on GitHub↗Upstream vllm-project/vllm-omni↗

Builder

vLLM

vllm-project • startup

Stars

4,841

Using upstream star count

Forks

1,018

Using upstream fork count

Open Issues

Activity Score

0/100

0 commits in 30d

Created

Sep 11, 2025

Project creation date

README Summary

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Taxonomy

AI Trends

Multimodal Reasoning Large Language Models Model Serving Infrastructure GPU-Accelerated Inference

Recent Activity

Updated 2 months ago

7 Days

30 Days

90 Days

[Bugfix] Restore chunk-waiting requests on OmniNewRequestData rewrap failure (#1691)

Du Bin • Mar 22, 2026

b4a96b0

[CI] Add Flux2 Klein Tests (#2027)

Alex Brooks • Mar 22, 2026

a5574a2

[FP8] enable hunyuan-image-3 diffusion model with fp8 online quant (#1935)

Chendi.Xue • Mar 22, 2026

28aee51

Quality

prototype

Quality: medium
Maturity: prototype

PM Skills

Cost & EfficiencySafety & AlignmentUser ExperienceScale & ReliabilityData & EvaluationAI-Native Architecture

Languages

Python100.0%

Timeline

Project created: Sep 11, 2025
Forked: Mar 22, 2026
Your last push: 2 months ago
Upstream last push: 16 days ago
Tracked since: Mar 22, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…

vllm-omni

README Summary

Community Evaluation

AI Dev Skills

Tags

Taxonomy

Recent Activity

Quality

Categories

PM Skills

Languages

Timeline

Similar Repos

vllm-omni

README Summary

Community Evaluation

AI Dev Skills

Tags

Taxonomy

Recent Activity

Quality

Categories

PM Skills

Languages

Timeline

Similar Repos