Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/stable-baselines3
Library/stable-baselines3Forked

DLR-RM/stable-baselines3

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

View on GitHub↗Upstream DLR-RM/stable-baselines3↗

Builder

DLR-RM

DLR-RM

DLR-RM • individual

Stars

13,345

Using upstream star count

Forks

2,141

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 5, 2020

Project creation date

README Summary

<!-- [![pipeline status](https://gitlab.com/araffin/stable-baselines3/badges/master/pipeline.svg)](https://gitlab.com/araffin/stable-baselines3/-/commits/master) --> [![CI](https://github.com/DLR-RM/stable-baselines3/workflows/CI/badge.svg)](https://github.com/DLR-RM/stable-baselines3/actions/workflows/ci.yml) [![Documentation Status](https://readthedocs.org/projects/stable-baselines/badge/?version=master)](https://stable-baselines3.readthedocs.io/en/master/?badge=master) [![coverage report](htt

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Actor-Critic MethodsAdvantage Actor-CriticContinuous ControlDeep Deterministic Policy GradientDeep Q-NetworksDiscrete ControlGymnasium Environment IntegrationNeural Network TrainingPolicy Gradient MethodsProximal Policy OptimizationReinforcement LearningSoft Actor-CriticTwin Delayed Deep Deterministic Policy Gradient

Tags

Actor-Critic MethodsAdvantage Actor-CriticContinuous ControlDeep Deterministic Policy GradientDeep Q-NetworksDiscrete ControlGymnasium Environment IntegrationNeural Network TrainingPolicy Gradient MethodsProximal Policy OptimizationReinforcement LearningSoft Actor-CriticTwin Delayed Deep Deterministic Policy GradientBenchmarkingComputer VisionCurated ListDockerEvalsForkedHuggingFaceJupyterMachine LearningPyTorchPythonRoadmapRoboticsScikit-learnTutorialWeights & Biases

Taxonomy

AI Trends

Agentic AIEmbodied AIAI Safety

category

Learning ResourcesFoundation ModelsModel TrainingEvals & BenchmarkingObservability & MonitoringComputer VisionRoboticsMLOps & InfrastructureData Science & Analytics

Deployment Context

Self-hostedCloud TrainingEdge DeploymentSimulation Environments

Industries

RoboticsGamingFinanceAutonomous VehiclesIndustrial Automation

Modalities

Numerical State SpacesImage ObservationsContinuous Action SpacesDiscrete Action Spaces

Skill Areas

Reinforcement LearningPolicy Gradient MethodsActor-Critic MethodsDeep Q-NetworksProximal Policy OptimizationSoft Actor-CriticTwin Delayed Deep Deterministic Policy GradientAdvantage Actor-CriticDeep Deterministic Policy GradientContinuous ControlDiscrete ControlNeural Network TrainingGymnasium Environment Integration

tag

BenchmarkingComputer VisionCurated ListDockerEvalsForkedHuggingFaceJupyterMachine LearningPyTorchPythonReinforcement LearningRoadmapRoboticsScikit-learnTutorialWeights & Biases

Use Cases

Robot Control and NavigationGame AI DevelopmentAlgorithmic TradingResource Allocation OptimizationAutonomous Decision MakingControl System DesignMulti-agent System Training

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

2

Update contribution guidelines regarding LLM/code assistant usage (#2231)

Antonin RAFFIN • Mar 18, 2026

a72be40

Update changelog for SB3 ecosystem (#2227)

Antonin RAFFIN • Mar 13, 2026

5bb5da5

Switch to Markdown documentation (MyST parser) (#2219)

Antonin RAFFIN • Feb 21, 2026

cc20f5a

Quality

production
Quality
high
Maturity
production

Categories

Learning ResourcesPrimaryEvals & BenchmarkingObservability & MonitoringMLOps & InfrastructureData Science & AnalyticsFoundation ModelsModel TrainingComputer VisionRoboticsSafety & AlignmentOther AI / ML

PM Skills

Scale & ReliabilityData & Evaluation

Languages

Python100.0%

Timeline

Project created
May 5, 2020
Forked
Mar 23, 2026
Your last push
2 months ago
Upstream last push
23 days ago
Tracked since
Mar 18, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…