Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/sam-audio
Library/sam-audioForked

facebookresearch/sam-audio

sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

View on GitHub↗Upstream facebookresearch/sam-audio↗

Builder

Meta Research

Meta Research

facebookresearch • ai-lab

Stars

3,509

Using upstream star count

Forks

316

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Sep 4, 2025

Project creation date

README Summary

[![arXiv](https://img.shields.io/badge/arXiv-2512.18099-b31b1b.svg)](https://arxiv.org/abs/2512.18099) ![CI](https://github.com/facebookresearch/sam-audio/actions/workflows/ci.yaml/badge.svg) [![Hugging Face](https://img.shields.io/badge/HuggingFace-Collection-orange?logo=huggingface)](https://huggingface.co/collections/facebook/sam-audio)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Audio Signal ProcessingComputer Vision SegmentationDeep Learning Model InferenceFoundation Model AdaptationMultimodal AITransformer Architecture

Tags

Audio Signal ProcessingComputer Vision SegmentationDeep Learning Model InferenceFoundation Model AdaptationMultimodal AITransformer ArchitectureBackendEmbeddingsEvalsForkedGPU / CUDAHuggingFaceMusic TechPyTorchPythonRerankingResearch / Papers

Taxonomy

AI Trends

Foundation ModelsMultimodal AISegment Anything Model Extensions

category

Foundation ModelsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingDev Tools & AutomationLearning ResourcesIndustry: Audio & Music

Deployment Context

Self-hostedResearch Environment

Industries

Audio TechnologyMedia ProcessingEntertainmentResearch

Modalities

AudioMultimodal

Skill Areas

Audio Signal ProcessingComputer Vision SegmentationDeep Learning Model InferenceTransformer ArchitectureMultimodal AIFoundation Model Adaptation

tag

BackendEmbeddingsEvalsForkedGPU / CUDAHuggingFaceMultimodal AIMusic TechPyTorchPythonRerankingResearch / Papers

Use Cases

Audio SegmentationSound Source SeparationAudio Content AnalysisAudio-Visual Synchronization

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
medium
Maturity
research

Categories

RAG & RetrievalPrimaryEvals & BenchmarkingInference & ServingDev Tools & AutomationLearning ResourcesIndustry: Audio & MusicFoundation ModelsModel TrainingGenerative MediaMultimodal AISearch & KnowledgeOther AI / ML

PM Skills

User ExperienceData & EvaluationProduct Discovery

Languages

Python100.0%

Timeline

Project created
Sep 4, 2025
Forked
Dec 27, 2025
Your last push
2 months ago
Upstream last push
4 months ago
Tracked since
Mar 17, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…