Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/AudioGPT
Library/AudioGPTForked

AIGC-Audio/AudioGPT

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

View on GitHub↗Upstream AIGC-Audio/AudioGPT↗

Builder

AIGC-Audio

AIGC-Audio

AIGC-Audio • individual

Stars

10,178

Using upstream star count

Forks

857

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Mar 16, 2023

Project creation date

README Summary

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Audio GenerationAudio ProcessingComputer VisionCross-Modal UnderstandingFoundation ModelsMultimodal AIMusic GenerationNeural Audio SynthesisSound Effect GenerationSpeech SynthesisTalking Head Video Generation

Tags

Audio GenerationAudio ProcessingComputer VisionCross-Modal UnderstandingFoundation ModelsMultimodal AIMusic GenerationNeural Audio SynthesisSound Effect GenerationSpeech SynthesisTalking Head Video GenerationForkedHuggingFaceImage GenerationLangChainMusic TechOpen SourceOpenAIResearch / PapersSpeech to TextStable DiffusionTransformers

Taxonomy

AI Trends

Foundation ModelsMultimodal ReasoningGenerative AIAudio AILarge Multimodal Models

category

Foundation ModelsAI AgentsGenerative MediaLearning ResourcesIndustry: Audio & Music

Deployment Context

Self-hostedCloud API

Industries

EntertainmentMedia ProductionEducationGamingMarketingAccessibility Technology

Modalities

AudioVideoSpeechMusicMultimodal

Skill Areas

Multimodal AIAudio GenerationSpeech SynthesisMusic GenerationSound Effect GenerationTalking Head Video GenerationFoundation ModelsCross-Modal UnderstandingAudio ProcessingComputer VisionNeural Audio Synthesis

tag

ForkedHuggingFaceImage GenerationLangChainMusic TechOpen SourceOpenAIResearch / PapersSpeech to TextStable DiffusionTransformers

Use Cases

Audio Content GenerationMusic CompositionSpeech SynthesisSound Effect CreationTalking Head Video GenerationAudio-Visual Content CreationVoice CloningMusic Production Assistance

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
medium
Maturity
research

Categories

Learning ResourcesPrimaryIndustry: Audio & MusicFoundation ModelsAI AgentsGenerative MediaSearch & KnowledgeOther AI / ML

PM Skills

User Experience

Languages

Python100.0%

Timeline

Project created
Mar 16, 2023
Forked
Mar 23, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
Jul 6, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…