Library/stable-baselines3
Library/stable-baselines3Forked

DLR-RM/stable-baselines3

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Builder

DLR-RM

DLR-RM

DLR-RM • individual

Stars

13,020

Using upstream star count

Forks

2,094

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 5, 2020

Project creation date

README Summary

Stable Baselines3 is a PyTorch-based library providing reliable, well-tested implementations of reinforcement learning algorithms. It serves as the successor to the original Stable Baselines library, offering improved performance and PyTorch integration. The library focuses on providing stable, documented RL algorithms that are easy to use and extend.

AI Dev Skills

Unmapped

Reinforcement LearningPolicy Gradient MethodsActor-Critic MethodsDeep Q-NetworksProximal Policy OptimizationSoft Actor-CriticTwin Delayed Deep Deterministic Policy GradientAdvantage Actor-CriticDeep Deterministic Policy GradientContinuous ControlDiscrete ControlNeural Network TrainingGymnasium Environment Integration

Tags

Reinforcement LearningPolicy Gradient MethodsActor-Critic MethodsDeep Q-NetworksProximal Policy OptimizationSoft Actor-CriticTwin Delayed Deep Deterministic Policy GradientAdvantage Actor-CriticDeep Deterministic Policy GradientContinuous ControlDiscrete ControlNeural Network TrainingGymnasium Environment IntegrationEdge DeploymentEmbodied AIFinanceNumerical State SpacesContinuous Action SpacesAlgorithmic TradingDiscrete Action SpacesResource Allocation OptimizationIndustrial AutomationMulti-agent System TrainingAI SafetyImage ObservationsAutonomous Decision MakingRobot Control and NavigationSelf-hostedCloud TrainingGame AI DevelopmentAgentic AIRoboticsGamingControl System DesignAutonomous VehiclesSimulation EnvironmentsPython

Taxonomy

Recent Activity

Updated 26 days ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

RoboticsPrimaryDev Tools & AutomationInference & ServingSafety & AlignmentFinance & LegalEdge & Mobile AIOther AI / MLModel TrainingAI Agents

PM Skills

Developer Platform

Languages

Python100.0%

Timeline

Project created
May 5, 2020
Forked
Mar 23, 2026
Your last push
26 days ago
Upstream last push
12 days ago
Tracked since
Mar 18, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…