Library/speechbrain
Library/speechbrainForked

speechbrain/speechbrain

speechbrain

A PyTorch-based Speech Toolkit

Builder

speechbrain

speechbrain

speechbrain • individual

Stars

11,384

Using upstream star count

Forks

1,675

Using upstream fork count

Open Issues

0

Activity Score

0/100

4 commits in 30d

Created

Apr 28, 2020

Project creation date

README Summary

SpeechBrain is a comprehensive PyTorch-based toolkit for speech and audio processing that provides pre-trained models and recipes for various tasks including speech recognition, speaker recognition, speech enhancement, and text-to-speech. The toolkit is designed to be user-friendly with simple APIs while maintaining flexibility for research and production use. It offers extensive documentation, tutorials, and a modular architecture that supports both beginners and advanced users.

AI Dev Skills

Unmapped

Automatic Speech RecognitionSpeaker Recognition and VerificationSpeech Enhancement and SeparationText-to-Speech SynthesisSpeech Emotion RecognitionLanguage IdentificationVoice Activity DetectionAudio Signal ProcessingDeep Learning for AudioTransformer Architecture for SpeechRecurrent Neural NetworksConvolutional Neural Networks for AudioSelf-Supervised Learning for SpeechMulti-task LearningTransfer Learning for Speech

Tags

Automatic Speech RecognitionSpeaker Recognition and VerificationSpeech Enhancement and SeparationText-to-Speech SynthesisSpeech Emotion RecognitionLanguage IdentificationVoice Activity DetectionAudio Signal ProcessingDeep Learning for AudioTransformer Architecture for SpeechRecurrent Neural NetworksConvolutional Neural Networks for AudioSelf-Supervised Learning for SpeechMulti-task LearningTransfer Learning for SpeechReal-time Voice TranscriptionAudio Book NarrationCloud APIVoice Command RecognitionTelecommunicationsMedia and EntertainmentVoice Biometric AuthenticationAudio Content AnalysisFoundation Models for SpeechHealthcareSpeech-to-Text ConversionAccessibility TechnologySecurity and SurveillanceOn-device AIMulti-speaker Audio SeparationAudio Quality EnhancementSelf-Supervised LearningSelf-hostedEducationVoice Assistant DevelopmentOn-premiseCustomer ServiceAutomated CaptioningEdge/MobileEdge AIMultimodal AICall Center AnalyticsAudioPython

Taxonomy

Recent Activity

Updated 1 months ago

7 Days

0

30 Days

4

90 Days

24

Quality

production
Quality
high
Maturity
production

Categories

Dev Tools & AutomationPrimaryInference & ServingCoding & Dev ToolsData Science & AnalyticsHealthcare & BiologyMultimodal AIEdge & Mobile AIOther AI / MLFoundation ModelsGenerative Media

PM Skills

Developer Platform

Languages

Python100.0%

Timeline

Project created
Apr 28, 2020
Forked
Mar 22, 2026
Your last push
1 months ago
Upstream last push
10 days ago
Tracked since
Mar 1, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…