Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/espnet
Library/espnetForked

espnet/espnet

espnet

End-to-End Speech Processing Toolkit

View on GitHub↗Upstream espnet/espnet↗

Builder

espnet

espnet

espnet • individual

Stars

9,846

Using upstream star count

Forks

2,405

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Dec 13, 2017

Project creation date

README Summary

<div align="left"><img src="doc/image/espnet_logo1.png" width="550"/></div>

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Attention MechanismsAudio Signal ProcessingAutomatic Speech RecognitionConnectionist Temporal ClassificationDeep Learning for SpeechEnd-to-End LearningMulti-task LearningNeural Network Architecture DesignNeural VocodingSequence-to-Sequence ModelingSpeech EnhancementSpeech TranslationText-to-Speech SynthesisTransformer Architecture

Tags

Attention MechanismsAudio Signal ProcessingAutomatic Speech RecognitionConnectionist Temporal ClassificationDeep Learning for SpeechEnd-to-End LearningMulti-task LearningNeural Network Architecture DesignNeural VocodingSequence-to-Sequence ModelingSpeech EnhancementSpeech TranslationText-to-Speech SynthesisTransformer ArchitectureAI SafetyBenchmarkingCLI ToolCourseDeep LearningDeepSpeedDistillationDockerEmbeddingsEvalsForkedHuggingFaceJupyterLarge Language ModelsMusic TechOpen SourceOpenAIPyTorchPythonReal-Time / StreamingResearch / PapersSpeech to TextText to SpeechTransformersTutorialWeights & Biases

Taxonomy

AI Trends

End-to-End LearningMultimodal AISelf-Supervised LearningNeural Audio ProcessingReal-time AI Processing

category

Foundation ModelsRAG & RetrievalModel TrainingEvals & BenchmarkingObservability & MonitoringInference & ServingGenerative MediaMLOps & InfrastructureDev Tools & AutomationLearning ResourcesIndustry: Audio & MusicSecurity & SafetyData Science & Analytics

Deployment Context

Self-hostedCloud APIOn-premiseEdge/Mobile

Industries

TelecommunicationsMedia and EntertainmentAssistive TechnologyEducationHealthcareCustomer ServiceAutomotive

Modalities

AudioTextSpeech

Skill Areas

Automatic Speech RecognitionText-to-Speech SynthesisSpeech TranslationSpeech EnhancementNeural Network Architecture DesignSequence-to-Sequence ModelingAttention MechanismsTransformer ArchitectureConnectionist Temporal ClassificationAudio Signal ProcessingDeep Learning for SpeechEnd-to-End LearningMulti-task LearningNeural Vocoding

tag

AI SafetyBenchmarkingCLI ToolCourseDeep LearningDeepSpeedDistillationDockerEmbeddingsEvalsForkedHuggingFaceJupyterLarge Language ModelsMusic TechOpen SourceOpenAIPyTorchPythonReal-Time / StreamingResearch / PapersSpeech to TextText to SpeechTransformersTutorialWeights & Biases

Use Cases

Automatic Speech RecognitionVoice Assistant DevelopmentSpeech-to-Text TranscriptionText-to-Speech SynthesisReal-time Speech TranslationAudio Enhancement and DenoisingVoice ConversionMultilingual Speech ProcessingSpeaker RecognitionSpeech Emotion Recognition

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

20

Merge pull request #6390 from Masao-Someki/espnet3/integration_test_1

Shinji Watanabe • Mar 18, 2026

119bf0c

Merge branch 'espnet3/integration_test_1' of github.com:Masao-Someki/espnet into espnet3/integration

Masao-Someki • Mar 18, 2026

b1097d3

Added unit test for base metrics

Masao-Someki • Mar 18, 2026

4980690

Quality

production
Quality
high
Maturity
production

Categories

RAG & RetrievalPrimaryEvals & BenchmarkingObservability & MonitoringInference & ServingMLOps & InfrastructureDev Tools & AutomationLearning ResourcesIndustry: Audio & MusicSecurity & SafetyData Science & AnalyticsFoundation ModelsModel TrainingGenerative MediaSafety & AlignmentSearch & KnowledgeOther AI / ML

PM Skills

Safety & AlignmentUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper Platform

Languages

Python100.0%

Timeline

Project created
Dec 13, 2017
Forked
Mar 22, 2026
Your last push
2 months ago
Upstream last push
20 days ago
Tracked since
Mar 18, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…