Library/AudioGPT
Library/AudioGPTForked

AIGC-Audio/AudioGPT

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Builder

AIGC-Audio

AIGC-Audio

AIGC-Audio • individual

Stars

10,205

Using upstream star count

Forks

861

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Mar 16, 2023

Project creation date

README Summary

AudioGPT is a multimodal AI system that combines large language models with various audio foundation models to understand and generate different types of audio content including speech, music, sound effects, and talking head videos. The system uses a modular architecture where different specialized models handle specific audio tasks, coordinated through natural language instructions. It provides a unified interface for complex audio processing tasks through conversational AI interactions.

AI Dev Skills

Unmapped

Multimodal AIAudio GenerationSpeech SynthesisMusic GenerationSound Effect GenerationTalking Head Video GenerationFoundation ModelsCross-Modal UnderstandingAudio ProcessingComputer VisionNeural Audio Synthesis

Tags

Multimodal AIAudio GenerationSpeech SynthesisMusic GenerationSound Effect GenerationTalking Head Video GenerationFoundation ModelsCross-Modal UnderstandingAudio ProcessingComputer VisionNeural Audio SynthesisAudio Content GenerationAudio AIMedia ProductionLarge Multimodal ModelsGenerative AIVideoMarketingAccessibility TechnologyGamingMultimodalMultimodal ReasoningSound Effect CreationAudio-Visual Content CreationEducationVoice CloningMusicEntertainmentSpeechMusic CompositionMusic Production AssistanceSelf-hostedCloud APIAudioPython

Taxonomy

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
medium
Maturity
research

Categories

Coding & Dev ToolsPrimaryMultimodal AIOther AI / MLGenerative MediaComputer VisionRoboticsFoundation Models

PM Skills

Scale & Reliability

Languages

Python100.0%

Timeline

Project created
Mar 16, 2023
Forked
Mar 23, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
Jul 6, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…