Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/TTS
Library/TTSForked

coqui-ai/TTS

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

View on GitHub↗Upstream coqui-ai/TTS↗

Builder

coqui-ai

coqui-ai

coqui-ai • individual

Stars

45,432

Using upstream star count

Forks

6,096

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 20, 2020

Project creation date

README Summary

🐸Coqui.ai News - 📣 ⓍTTSv2 is here with 16 languages and better performance across the board. - 📣 ⓍTTS fine-tuning code is out. Check the [example recipes](https://github.com/coqui-ai/TTS/tree/dev/recipes/ljspeech). - 📣 ⓍTTS can now stream with <200ms latency. - 📣 ⓍTTS, our production TTS model that can speak 13 languages, is released [Blog Post](https://coqui.ai/blog/tts/open_xtts), [Demo](https://huggingface.co/spaces/coqui/xtts), [Docs](https://tts.readthedocs.io/en/dev/models/xtts.html) - 📣 [

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Attention MechanismsAudio Signal ProcessingAutoregressive Language ModelingConvolutional Neural NetworksDeep Generative ModelsMel-spectrogram GenerationMulti-speaker ModelingNeural Vocoder ImplementationSequence-to-Sequence LearningTacotron ArchitectureText-to-Speech SynthesisTransformer ArchitectureVoice CloningWaveNet Implementation

Tags

Attention MechanismsAudio Signal ProcessingAutoregressive Language ModelingConvolutional Neural NetworksDeep Generative ModelsMel-spectrogram GenerationMulti-speaker ModelingNeural Vocoder ImplementationSequence-to-Sequence LearningTacotron ArchitectureText-to-Speech SynthesisTransformer ArchitectureVoice CloningWaveNet ImplementationAI SafetyCLI ToolData ScienceDeep LearningDockerEmbeddingsEvalsForkedGPU / CUDAHuggingFacePyTorchPythonResearch / PapersText to Speech

Taxonomy

AI Trends

Generative AIFoundation ModelsZero-shot LearningOn-device AIMultimodal AI

category

Foundation ModelsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingGenerative MediaMLOps & InfrastructureDev Tools & AutomationLearning ResourcesIndustry: Audio & MusicSecurity & SafetyData Science & Analytics

Deployment Context

Self-hostedCloud APIEdge/MobileOn-premise

Industries

Media & EntertainmentEducationAccessibility TechnologyGamingAudiobook ProductionVoice Assistant DevelopmentContent Creation

Modalities

TextAudio

Skill Areas

Text-to-Speech SynthesisNeural Vocoder ImplementationAutoregressive Language ModelingAttention MechanismsMel-spectrogram GenerationVoice CloningMulti-speaker ModelingAudio Signal ProcessingDeep Generative ModelsSequence-to-Sequence LearningTransformer ArchitectureConvolutional Neural NetworksWaveNet ImplementationTacotron Architecture

tag

AI SafetyCLI ToolData ScienceDeep LearningDockerEmbeddingsEvalsForkedGPU / CUDAHuggingFacePyTorchPythonResearch / PapersText to SpeechVoice Cloning

Use Cases

Audiobook NarrationVoice Assistant ResponsesPodcast GenerationLanguage Learning ApplicationsAccessibility Screen ReadingVideo Game Character VoicesInteractive Voice Response SystemsNews Article Audio ConversionCustom Voice CreationMultilingual Speech Synthesis

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

Foundation ModelsPrimaryRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingGenerative MediaSafety & AlignmentData Science & AnalyticsSearch & KnowledgeOther AI / MLMLOps & InfrastructureDev Tools & AutomationLearning ResourcesIndustry: Audio & MusicSecurity & Safety

PM Skills

Safety & AlignmentUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper Platform

Languages

Python100.0%

Timeline

Project created
May 20, 2020
Forked
Mar 22, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
Aug 16, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…