anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
Builder
Anthropic
anthropics • ai-lab
Stars
1,839
Using upstream star count
Forks
159
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
Apr 10, 2022
Project creation date
> [!NOTE] > This github repo is now deprecated in favor of the HuggingFace hosted repository which contains the same data: https://huggingface.co/datasets/Anthropic/hh-rlhf
Unmapped
Deployment Context
Modalities
Skill Areas
tag
Updated 11 months ago
7 Days
0
30 Days
0
90 Days
0
No language breakdown recorded.
pgvector cosine similarity · $0
Loading…