Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/VibeVoice
Library/VibeVoiceForked

microsoft/VibeVoice

VibeVoice

Open-Source Frontier Voice AI

View on GitHub↗Upstream microsoft/VibeVoice↗

Builder

Microsoft

Microsoft

microsoft • big-tech

Stars

47,561

Using upstream star count

Forks

5,308

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Aug 25, 2025

Project creation date

README Summary

🎙️ VibeVoice: Open-Source Frontier Voice AI [![Project Page](https://img.shields.io/badge/Project-Page-blue?logo=githubpages)](https://microsoft.github.io/VibeVoice) [![Hugging Face](https://img.shields.io/badge/HuggingFace-Collection-orange?logo=huggingface)](https://huggingface.co/collections/microsoft/vibevoice-68a2ef24a875c44be47b034f) [![TTS Report](https://img.shields.io/badge/TTS-Report-red?logo=arxiv)](https://arxiv.org/pdf/2508.19205) [![ASR Report](https://img.shields.io/badge/ASR-Repo

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Audio ProcessingAudio Signal ProcessingNeural Voice GenerationSpeech RecognitionSpeech-to-TextText-to-SpeechVoice AI Model TrainingVoice Synthesis

Tags

Audio ProcessingAudio Signal ProcessingNeural Voice GenerationSpeech RecognitionSpeech-to-TextText-to-SpeechVoice AI Model TrainingVoice SynthesisFine-TuningForkedHuggingFaceJupyterLLM ServingLarge Language ModelsQwenReal-Time / StreamingResearch / PapersSpeech to TextStructured OutputText to SpeechTransformersvLLM

Taxonomy

AI Trends

Open Source AIVoice AIGenerative AIFrontier AI Models

category

Foundation ModelsAI AgentsModel TrainingInference & ServingGenerative MediaLearning ResourcesData Science & Analytics

Deployment Context

Self-hostedCloud APIOn-premise

Industries

Media & EntertainmentTelecommunicationsAccessibility TechnologyGamingEducationCustomer Service

Modalities

AudioText

Skill Areas

Voice SynthesisSpeech RecognitionAudio ProcessingNeural Voice GenerationSpeech-to-TextText-to-SpeechVoice AI Model TrainingAudio Signal Processing

tag

Fine-TuningForkedHuggingFaceJupyterLLM ServingLarge Language ModelsQwenReal-Time / StreamingResearch / PapersSpeech to TextStructured OutputText to SpeechTransformersvLLM

Use Cases

Voice Assistant DevelopmentReal-time Voice SynthesisSpeech Recognition SystemsVoice CloningAccessibility Voice ToolsInteractive Voice Applications

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

2

Merge pull request #255 from sd983527/main

Zhiliang Peng • Mar 6, 2026

4c41997

Add news about VibeVoice ASR Transformers integration

Yan Xia • Mar 6, 2026

7e73bee

Merge pull request #247 from Damon-Salvetore/fix/vllm-version-compat

Li Dong • Feb 28, 2026

7ef9dbe

Quality

prototype
Quality
medium
Maturity
prototype

Categories

Inference & ServingPrimaryLearning ResourcesData Science & AnalyticsFoundation ModelsAI AgentsModel TrainingGenerative MediaSearch & Knowledge

PM Skills

Cost & EfficiencyUser ExperienceScale & ReliabilityDeveloper Platform

Languages

Python100.0%

Timeline

Project created
Aug 25, 2025
Forked
Mar 13, 2026
Your last push
2 months ago
Upstream last push
28 days ago
Tracked since
Mar 6, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…