Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/AudioLDM2
Library/AudioLDM2Forked

haoheliu/AudioLDM2

AudioLDM2

Text-to-Audio/Music Generation

View on GitHub↗Upstream haoheliu/AudioLDM2↗

Builder

haoheliu

haoheliu

haoheliu • individual

Stars

2,629

Using upstream star count

Forks

208

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Aug 4, 2023

Project creation date

README Summary

[![arXiv](https://img.shields.io/badge/arXiv-2308.05734-brightgreen.svg?style=flat-square)](https://arxiv.org/abs/2308.05734) [![githubio](https://img.shields.io/badge/GitHub.io-Audio_Samples-blue?logo=Github&style=flat-square)](https://audioldm.github.io/audioldm2/) [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/haoheliu/audioldm2-text2audio-text2music)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Audio Feature ExtractionAudio Signal ProcessingCross-modal LearningGenerative AILatent Diffusion ModelsMultimodal Deep LearningNeural Audio SynthesisText-to-Audio Synthesis

Tags

Audio Feature ExtractionAudio Signal ProcessingCross-modal LearningGenerative AILatent Diffusion ModelsMultimodal Deep LearningNeural Audio SynthesisText-to-Audio SynthesisFine-TuningForkedGPU / CUDAHuggingFaceJupyterMachine LearningMusic / Audio AIMusic TechPrompt EngineeringPythonPyTorchResearch / PapersSpeech to TextText to SpeechTransformers

Taxonomy

AI Trends

Generative AIMultimodal AIDiffusion Models

category

Generative MediaFoundation ModelsAI AgentsModel TrainingInference & ServingLearning ResourcesIndustry: Audio & MusicData Science & Analytics

Deployment Context

Self-hostedCloud API

Industries

Media & EntertainmentMusic ProductionGamingAdvertisingContent Creation

Modalities

TextAudioMultimodal

Skill Areas

Latent Diffusion ModelsText-to-Audio SynthesisAudio Signal ProcessingMultimodal Deep LearningGenerative AIAudio Feature ExtractionNeural Audio SynthesisCross-modal Learning

tag

Fine-TuningForkedGPU / CUDAHuggingFaceJupyterMachine LearningMusic / Audio AIMusic TechPrompt EngineeringPyTorchPythonResearch / PapersSpeech to TextText to SpeechTransformers

Use Cases

Text-to-Audio GenerationMusic Composition from TextSound Effect GenerationAudio Content CreationAutomated Music Production

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
medium
Maturity
research

Categories

Inference & ServingPrimaryLearning ResourcesIndustry: Audio & MusicData Science & AnalyticsFoundation ModelsAI AgentsModel TrainingGenerative MediaSearch & KnowledgeOther AI / ML

PM Skills

User Experience

Languages

Python100.0%

Timeline

Project created
Aug 4, 2023
Forked
Mar 23, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
Sep 29, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…