Library/AudioLDM2
Library/AudioLDM2Forked

haoheliu/AudioLDM2

AudioLDM2

Text-to-Audio/Music Generation

Builder

haoheliu

haoheliu

haoheliu • individual

Stars

2,609

Using upstream star count

Forks

209

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Aug 4, 2023

Project creation date

README Summary

AudioLDM2 is a text-to-audio and text-to-music generation model that uses latent diffusion techniques to synthesize high-quality audio from text descriptions. The repository provides pre-trained models and code for generating various types of audio content including sound effects, music, and speech from natural language prompts. It offers improved performance over the original AudioLDM with better audio quality and more diverse generation capabilities.

AI Dev Skills

Unmapped

Latent Diffusion ModelsText-to-Audio SynthesisAudio Signal ProcessingMultimodal Deep LearningGenerative AIAudio Feature ExtractionNeural Audio SynthesisCross-modal Learning

Tags

Latent Diffusion ModelsText-to-Audio SynthesisAudio Signal ProcessingMultimodal Deep LearningGenerative AIAudio Feature ExtractionNeural Audio SynthesisCross-modal LearningMusic ProductionSelf-hostedText-to-Audio GenerationAudioAutomated Music ProductionSound Effect GenerationMultimodal AIAdvertisingGamingDiffusion ModelsAudio Content CreationContent CreationTextMultimodalMedia & EntertainmentMusic Composition from TextCloud APIPython

Taxonomy

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
medium
Maturity
research

Categories

Foundation ModelsPrimaryMultimodal AIOther AI / MLGenerative MediaRobotics

PM Skills

Scale & Reliability

Languages

Python100.0%

Timeline

Project created
Aug 4, 2023
Forked
Mar 23, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
Sep 29, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…