Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/weak-to-strong
Library/weak-to-strongForked

openai/weak-to-strong

weak-to-strong

**STATUS**: This codebase is not well tested and does not use the exact same settings we used in the paper, but in our experience gives qualitatively

View on GitHub↗Upstream openai/weak-to-strong↗

Builder

OpenAI

OpenAI

openai • ai-lab

Stars

2,553

Using upstream star count

Forks

312

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Dec 13, 2023

Project creation date

README Summary

**STATUS**: This codebase is not well tested and does not use the exact same settings we used in the paper, but in our experience gives qualitatively similar results when using large model size gaps and multiple seeds. Expected results can be found for two datasets below.

Community Evaluation

Loading…

AI Dev Skills

Unmapped

AI GovernanceAI Safety ResearchMachine Learning Research MethodologyModel AlignmentNeural Network TrainingSupervised LearningWeak-to-Strong Generalization

Tags

AI GovernanceAI Safety ResearchMachine Learning Research MethodologyModel AlignmentNeural Network TrainingSupervised LearningWeak-to-Strong GeneralizationAI SafetyAnthropic / ClaudeCLI ToolData ScienceForkedHuggingFaceJupyterLarge Language ModelsOpenAIPythonTransformersTutorial

Taxonomy

AI Trends

AI SafetyAI AlignmentScalable OversightWeak-to-Strong Generalization

category

Foundation ModelsDev Tools & AutomationLearning ResourcesSecurity & SafetyData Science & Analytics

Deployment Context

Self-hostedResearch Environment

Modalities

Text

Skill Areas

AI Safety ResearchModel AlignmentWeak-to-Strong GeneralizationSupervised LearningNeural Network TrainingAI GovernanceMachine Learning Research Methodology

tag

AI SafetyAnthropic / ClaudeCLI ToolData ScienceForkedHuggingFaceJupyterLarge Language ModelsOpenAIPythonTransformersTutorial

Use Cases

AI Model Alignment ResearchWeak Supervision ExperimentsAI Safety EvaluationModel Generalization Studies

Recent Activity

Updated 2 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
medium
Maturity
research

Categories

Dev Tools & AutomationPrimaryLearning ResourcesSecurity & SafetyData Science & AnalyticsFoundation ModelsSafety & AlignmentOther AI / ML

PM Skills

Data & EvaluationDeveloper PlatformSafety & Alignment

Languages

Python100.0%

Timeline

Project created
Dec 13, 2023
Forked
Mar 14, 2026
Your last push
2 years ago
Upstream last push
2 years ago
Tracked since
May 19, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…