Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/mlx-audio
Library/mlx-audioForked

Blaizzy/mlx-audio

mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

View on GitHub↗Upstream Blaizzy/mlx-audio↗

Builder

Blaizzy

Blaizzy

Blaizzy • individual

Stars

7,141

Using upstream star count

Forks

612

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Nov 27, 2024

Project creation date

README Summary

The best audio processing library built on Apple's MLX framework, providing fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon.

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Apple MLX FrameworkAudio Signal ProcessingAutomatic Speech RecognitionNeural VocodingOn-device Machine LearningSpeech SynthesisSpeech-to-Speech TranslationVoice Conversion

Tags

Apple MLX FrameworkAudio Signal ProcessingAutomatic Speech RecognitionNeural VocodingOn-device Machine LearningSpeech SynthesisSpeech-to-Speech TranslationVoice ConversionAI SafetyAPIBackendForkedHealthcare AIHuggingFaceMistralMobileMultimodal AIMusic TechOpenAIPyTorchPythonQuantizationReal-Time / StreamingSpeech to TextText to SpeechTransformers

Taxonomy

AI Trends

On-device AIEdge ComputingApple Silicon OptimizationReal-time Speech Processing

category

Foundation ModelsModel TrainingInference & ServingGenerative MediaDev Tools & AutomationIndustry: HealthcareIndustry: Audio & MusicSecurity & Safety

Deployment Context

Edge/MobileOn-premise

Industries

Media & EntertainmentAccessibility TechnologyMobile App DevelopmentEducationHealthcare

Modalities

AudioText

Skill Areas

Speech SynthesisAutomatic Speech RecognitionVoice ConversionNeural VocodingApple MLX FrameworkOn-device Machine LearningAudio Signal ProcessingSpeech-to-Speech Translation

tag

AI SafetyAPIBackendForkedHealthcare AIHuggingFaceMistralMobileMultimodal AIMusic TechOpenAIPyTorchPythonQuantizationReal-Time / StreamingSpeech to TextText to SpeechTransformers

Use Cases

Voice Assistant DevelopmentReal-time Speech TranscriptionText-to-Speech for AccessibilityVoice CloningSpeech-to-Speech TranslationAudio Content GenerationVoice Interface Development

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

20

Merge pull request #569 from mm65x/add-moshi-sts

Lucas Newman • Mar 16, 2026

da79aaa

fix(moshi): remove nonexistent ConditionFuser import from modules __init__

mm65x • Mar 16, 2026

31eb403

style(moshi): fix black and isort formatting in modules

mm65x • Mar 16, 2026

1f57c1f

Quality

prototype
Quality
medium
Maturity
prototype

Categories

Inference & ServingPrimaryDev Tools & AutomationIndustry: HealthcareIndustry: Audio & MusicSecurity & SafetyFoundation ModelsModel TrainingGenerative MediaSafety & AlignmentHealthcare & BiologyMultimodal AIEdge & Mobile AIOther AI / ML

PM Skills

Cost & EfficiencySafety & AlignmentUser ExperienceScale & ReliabilityDeveloper Platform

Languages

Python100.0%

Timeline

Project created
Nov 27, 2024
Forked
Mar 16, 2026
Your last push
2 months ago
Upstream last push
20 days ago
Tracked since
Mar 17, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…