Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/prm800k
Library/prm800kForked

openai/prm800k

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

View on GitHub↗Upstream openai/prm800k↗

Builder

OpenAI

OpenAI

openai • ai-lab

Stars

2,136

Using upstream star count

Forks

126

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Apr 13, 2023

Project creation date

README Summary

[[Blog Post]](https://openai.com/research/improving-mathematical-reasoning-with-process-supervision) [[Paper]](https://arxiv.org/abs/2305.20050)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

AI Safety and AlignmentDataset Curation and LabelingLarge Language Model TrainingMathematical Reasoning EvaluationProcess Reward ModelingReinforcement Learning from Human FeedbackStep-level Supervision

Tags

AI Safety and AlignmentDataset Curation and LabelingLarge Language Model TrainingMathematical Reasoning EvaluationProcess Reward ModelingReinforcement Learning from Human FeedbackStep-level SupervisionEvalsForkedJavaScriptOpenAIPythonResearch / Papers

Taxonomy

AI Trends

AI SafetyProcess SupervisionMathematical ReasoningHuman Feedback Learning

category

Foundation ModelsEvals & BenchmarkingLearning Resources

Deployment Context

Research EnvironmentModel Training Infrastructure

Industries

Education

Modalities

Text

Skill Areas

Process Reward ModelingMathematical Reasoning EvaluationStep-level SupervisionReinforcement Learning from Human FeedbackLarge Language Model TrainingAI Safety and AlignmentDataset Curation and Labeling

tag

EvalsForkedJavaScriptOpenAIPythonResearch / Papers

Use Cases

Mathematical Problem Solving EvaluationAI Reasoning Step VerificationProcess Reward Model TrainingMathematical Tutoring System Development

Recent Activity

Updated 3 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
high
Maturity
research

Categories

Evals & BenchmarkingPrimaryLearning ResourcesFoundation ModelsSearch & KnowledgeOther AI / ML

PM Skills

Data & Evaluation

Languages

Python100.0%

Timeline

Project created
Apr 13, 2023
Forked
Mar 14, 2026
Your last push
3 years ago
Upstream last push
3 years ago
Tracked since
Jun 1, 2023

Similar Repos

pgvector cosine similarity · $0

Loading…