Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/speechbrain
Library/speechbrainForked

speechbrain/speechbrain

speechbrain

A PyTorch-based Speech Toolkit

View on GitHub↗Upstream speechbrain/speechbrain↗

Builder

speechbrain

speechbrain

speechbrain • individual

Stars

11,575

Using upstream star count

Forks

1,690

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Apr 28, 2020

Project creation date

README Summary

<p align="center"> <img src="https://raw.githubusercontent.com/speechbrain/speechbrain/develop/docs/images/speechbrain-logo.svg" alt="SpeechBrain Logo"/> </p>

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Audio Signal ProcessingAutomatic Speech RecognitionConvolutional Neural Networks for AudioDeep Learning for AudioLanguage IdentificationMulti-task LearningRecurrent Neural NetworksSelf-Supervised Learning for SpeechSpeaker Recognition and VerificationSpeech Emotion RecognitionSpeech Enhancement and SeparationText-to-Speech SynthesisTransfer Learning for SpeechTransformer Architecture for SpeechVoice Activity Detection

Tags

Audio Signal ProcessingAutomatic Speech RecognitionConvolutional Neural Networks for AudioDeep Learning for AudioLanguage IdentificationMulti-task LearningRecurrent Neural NetworksSelf-Supervised Learning for SpeechSpeaker Recognition and VerificationSpeech Emotion RecognitionSpeech Enhancement and SeparationText-to-Speech SynthesisTransfer Learning for SpeechTransformer Architecture for SpeechVoice Activity DetectionAI SafetyBatchingCourseDeep LearningEvalsForkedHuggingFaceMachine LearningMultimodal AIOpenAIPyTorchPythonReal-Time / StreamingResearch / PapersSpeech to TextTransformersTutorial

Taxonomy

AI Trends

Self-Supervised LearningFoundation Models for SpeechOn-device AIMultimodal AIEdge AI

category

Foundation ModelsModel TrainingEvals & BenchmarkingInference & ServingGenerative MediaLearning ResourcesSecurity & Safety

Deployment Context

Self-hostedCloud APIOn-premiseEdge/Mobile

Industries

TelecommunicationsHealthcareEducationMedia and EntertainmentCustomer ServiceAccessibility TechnologySecurity and Surveillance

Modalities

Audio

Skill Areas

Automatic Speech RecognitionSpeaker Recognition and VerificationSpeech Enhancement and SeparationText-to-Speech SynthesisSpeech Emotion RecognitionLanguage IdentificationVoice Activity DetectionAudio Signal ProcessingDeep Learning for AudioTransformer Architecture for SpeechRecurrent Neural NetworksConvolutional Neural Networks for AudioSelf-Supervised Learning for SpeechMulti-task LearningTransfer Learning for Speech

tag

AI SafetyBatchingCourseDeep LearningEvalsForkedHuggingFaceMachine LearningMultimodal AIOpenAIPyTorchPythonReal-Time / StreamingResearch / PapersSpeech to TextTransformersTutorial

Use Cases

Real-time Voice TranscriptionVoice Assistant DevelopmentAudio Content AnalysisSpeech-to-Text ConversionVoice Biometric AuthenticationAudio Quality EnhancementMulti-speaker Audio SeparationAutomated CaptioningVoice Command RecognitionAudio Book NarrationCall Center Analytics

Recent Activity

Updated 3 months ago

7 Days

0

30 Days

0

90 Days

0

Adding SENSE models (#2998)

bouziane maryem • Mar 1, 2026

aca5e41

Fix author name typo: Abous-Rjeili → Abou-Rjeili (#3038)

Georges • Mar 1, 2026

3eb32f6

Quality

production
Quality
high
Maturity
production

Categories

Evals & BenchmarkingPrimaryInference & ServingLearning ResourcesSecurity & SafetyFoundation ModelsModel TrainingGenerative MediaSafety & AlignmentMultimodal AISearch & KnowledgeOther AI / ML

PM Skills

Safety & AlignmentUser ExperienceScale & ReliabilityData & Evaluation

Languages

Python100.0%

Timeline

Project created
Apr 28, 2020
Forked
Mar 22, 2026
Your last push
3 months ago
Upstream last push
21 days ago
Tracked since
Mar 1, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…