Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/generative-models
Library/generative-modelsForked

Stability-AI/generative-models

generative-models

Generative Models by Stability AI

View on GitHub↗Upstream Stability-AI/generative-models↗

Builder

Stability-AI

Stability-AI

Stability-AI • individual

Stars

27,179

Using upstream star count

Forks

3,089

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jun 22, 2023

Project creation date

README Summary

**May 20, 2025** - We are releasing **[Stable Video 4D 2.0 (SV4D 2.0)](https://huggingface.co/stabilityai/sv4d2.0)**, an enhanced video-to-4D diffusion model for high-fidelity novel-view video synthesis and 4D asset generation. For research purposes: - **SV4D 2.0** was trained to generate 48 frames (12 video frames x 4 camera views) at 576x576 resolution, given a 12-frame input video of the same size, ideally consisting of white-background images of a moving object. - Compared to our pre

Community Evaluation

Loading…

AI Dev Skills

Unmapped

CLIP Text EncodingComputer VisionDeep Learning Model TrainingDiffusion ModelsLatent Diffusion ModelsNoise SchedulingSampling MethodsStable Diffusion ArchitectureText-to-Image GenerationU-Net ArchitectureVariational AutoencodersVideo Generation

Tags

CLIP Text EncodingComputer VisionDeep Learning Model TrainingDiffusion ModelsLatent Diffusion ModelsNoise SchedulingSampling MethodsStable Diffusion ArchitectureText-to-Image GenerationU-Net ArchitectureVariational AutoencodersVideo GenerationDeep LearningDistillationForkedGPU / CUDAHuggingFaceImage GenerationNumPyOpenAIPyTorchPythonResearch / PapersStable DiffusionTransformersWatermarking

Taxonomy

AI Trends

Generative AIDiffusion ModelsFoundation ModelsOpen Source AIMultimodal AI

category

Foundation ModelsModel TrainingInference & ServingGenerative MediaComputer VisionLearning ResourcesSecurity & SafetyData Science & Analytics

Deployment Context

Self-hostedCloud APIOn-premise

Industries

Media & EntertainmentCreative IndustriesMarketing & AdvertisingGamingFilm & Video Production

Modalities

TextImageVideoMultimodal

Skill Areas

Diffusion ModelsVideo GenerationText-to-Image GenerationComputer VisionDeep Learning Model TrainingStable Diffusion ArchitectureLatent Diffusion ModelsVariational AutoencodersCLIP Text EncodingU-Net ArchitectureNoise SchedulingSampling Methods

tag

Computer VisionDeep LearningDistillationForkedGPU / CUDAHuggingFaceImage GenerationNumPyOpenAIPyTorchPythonResearch / PapersStable DiffusionTransformersVideo GenerationWatermarking

Use Cases

Text-to-Image GenerationText-to-Video GenerationImage-to-Video GenerationCreative Content GenerationSynthetic Media ProductionArt GenerationVideo Synthesis

Recent Activity

Updated 5 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

Foundation ModelsPrimaryModel TrainingInference & ServingGenerative MediaComputer VisionCoding & Dev ToolsData Science & AnalyticsSearch & KnowledgeOther AI / MLLearning ResourcesSecurity & Safety

PM Skills

Product Discovery

Languages

Python100.0%

Timeline

Project created
Jun 22, 2023
Forked
Mar 22, 2026
Your last push
5 months ago
Upstream last push
5 months ago
Tracked since
Dec 16, 2025

Similar Repos

pgvector cosine similarity · $0

Loading…