Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/dia
Library/diaForked

nari-labs/dia

dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

View on GitHub↗Upstream nari-labs/dia↗

Builder

nari-labs

nari-labs

nari-labs • individual

Stars

19,299

Using upstream star count

Forks

1,684

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Apr 19, 2025

Project creation date

README Summary

<p align="center"> <a href="https://github.com/nari-labs/dia"> <img src="./dia/static/images/banner.png"> </a> </p> <p align="center"> <a href="https://tally.so/r/meokbo" target="_blank"><img alt="Static Badge" src="https://img.shields.io/badge/Join-Waitlist-white?style=for-the-badge"></a> <a href="https://discord.gg/bJq6vjRRKv" target="_blank"><img src="https://img.shields.io/badge/Discord-Join%20Chat-7289DA?logo=discord&style=for-the-badge"></a> <a href="https://github.com/nari-labs/dia/blob/m

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Audio Signal ProcessingDeep Learning Model DevelopmentDialogue GenerationNeural Audio GenerationSpeech Synthesis ArchitectureText-to-Speech SynthesisVoice Cloning

Tags

Audio Signal ProcessingDeep Learning Model DevelopmentDialogue GenerationNeural Audio GenerationSpeech Synthesis ArchitectureText-to-Speech SynthesisVoice CloningDockerForkedGPU / CUDAHuggingFacePyTorchPythonQuantizationResearch / PapersText to SpeechTransformers

Taxonomy

AI Trends

Generative AIReal-time AIVoice AISingle-pass Generation

category

Foundation ModelsModel TrainingInference & ServingGenerative MediaMLOps & InfrastructureLearning Resources

Deployment Context

Self-hostedCloud APIOn-premise

Industries

EntertainmentGamingEducationAccessibility TechnologyMedia Production

Modalities

TextAudio

Skill Areas

Text-to-Speech SynthesisNeural Audio GenerationDeep Learning Model DevelopmentSpeech Synthesis ArchitectureAudio Signal ProcessingDialogue GenerationVoice Cloning

tag

DockerForkedGPU / CUDAHuggingFacePyTorchPythonQuantizationResearch / PapersText to SpeechTransformers

Use Cases

Conversational AI Voice GenerationAudiobook ProductionVoice DubbingInteractive Dialogue SystemsAccessibility Voice SynthesisGaming Character Voices

Recent Activity

Updated 6 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

prototype
Quality
medium
Maturity
prototype

Categories

Inference & ServingPrimaryMLOps & InfrastructureLearning ResourcesFoundation ModelsModel TrainingGenerative MediaSearch & Knowledge

PM Skills

Cost & EfficiencyUser ExperienceScale & Reliability

Languages

Python100.0%

Timeline

Project created
Apr 19, 2025
Forked
Mar 23, 2026
Your last push
6 months ago
Upstream last push
6 months ago
Tracked since
Nov 19, 2025

Similar Repos

pgvector cosine similarity · $0

Loading…