Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/ROLL
Library/ROLLForked

alibaba/ROLL

ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

View on GitHub↗Upstream alibaba/ROLL↗

Builder

alibaba

alibaba

alibaba • individual

Stars

3,184

Using upstream star count

Forks

287

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 28, 2025

Project creation date

README Summary

<img src="assets/roll.jpeg" width="40%" alt="ROLL Logo">

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Deep Reinforcement LearningDistributed ComputingGPU OptimizationLanguage Model Fine-tuningLarge Language Model TrainingModel ParallelismParallel ProcessingReinforcement LearningScalable ML Systems

Tags

Deep Reinforcement LearningDistributed ComputingGPU OptimizationLanguage Model Fine-tuningLarge Language Model TrainingModel ParallelismParallel ProcessingReinforcement LearningScalable ML SystemsAI AgentsAI SafetyCourseDPODeepSpeedDockerFSDPForkedGRPOHuggingFaceImage GenerationInferenceLLM ServingLarge Language ModelsLoRA / PEFTPlanning / CoTQwenReal-Time / StreamingResearch / PapersSGLangTool UseVideo GenerationWeights & BiasesvLLM

Taxonomy

AI Trends

Large Language ModelsReinforcement Learning from Human FeedbackScalable AI TrainingDistributed AI Systems

category

Model TrainingFoundation ModelsAI AgentsObservability & MonitoringInference & ServingGenerative MediaMLOps & InfrastructureLearning ResourcesSecurity & Safety

Deployment Context

CloudMulti-GPU ClustersDistributed ComputingHigh-performance Computing

Industries

AI ResearchMachine Learning InfrastructureCloud Computing

Modalities

Text

Skill Areas

Reinforcement LearningLarge Language Model TrainingDistributed ComputingModel ParallelismGPU OptimizationDeep Reinforcement LearningLanguage Model Fine-tuningScalable ML SystemsParallel Processing

tag

AI AgentsAI SafetyActiveCourseDPODeepSpeedDockerFSDPForkedGRPOHuggingFaceImage GenerationInferenceLLM ServingLarge Language ModelsLoRA / PEFTPlanning / CoTQwenReal-Time / StreamingReinforcement LearningResearch / PapersSGLangTool UseVideo GenerationWeights & BiasesvLLM

Use Cases

Distributed RL TrainingLarge-scale Language Model Reinforcement LearningMulti-GPU RL WorkloadsScalable AI Agent TrainingHigh-performance ML Model Training

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

20

add reference to notable work

pUmpKin-Co • Mar 26, 2026

f509efc

Update config name in run_onpolicy_distill_pipeline.sh

JoeyChow • Mar 24, 2026

345edea

Add Qwen3.5 ROCK agentic SWE example

shamanez • Mar 24, 2026

4fd4147

Quality

research
Quality
medium
Maturity
research

Categories

Observability & MonitoringPrimaryInference & ServingMLOps & InfrastructureLearning ResourcesSecurity & SafetyFoundation ModelsAI AgentsModel TrainingGenerative MediaSafety & AlignmentCoding & Dev ToolsSearch & KnowledgeOther AI / ML

PM Skills

Cost & EfficiencySafety & AlignmentScale & ReliabilityDeveloper PlatformAI-Native Architecture

Languages

Python100.0%

Timeline

Project created
May 28, 2025
Forked
Mar 29, 2026
Your last push
2 months ago
Upstream last push
16 days ago
Tracked since
Mar 29, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…