Library/snorkel
Library/snorkelForked

snorkel-team/snorkel

snorkel

A system for quickly generating training data with weak supervision

Builder

snorkel-team

snorkel-team

snorkel-team • individual

Stars

5,946

Using upstream star count

Forks

855

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Feb 26, 2016

Project creation date

README Summary

Snorkel is a system for programmatically building and managing training datasets without manual labeling. It enables users to rapidly create large training sets by writing labeling functions that express weak supervision sources, which are then combined using data programming techniques to produce probabilistic training labels.

AI Dev Skills

Unmapped

Weak SupervisionTraining Data GenerationProgrammatic LabelingData-Centric AILabel Function DesignMulti-Source LearningProbabilistic ModelsActive LearningData AugmentationMachine Learning Pipeline Design

Tags

Weak SupervisionTraining Data GenerationProgrammatic LabelingData-Centric AILabel Function DesignMulti-Source LearningProbabilistic ModelsActive LearningData AugmentationMachine Learning Pipeline DesignE-commerceSelf-hostedMedia & EntertainmentTextGovernmentImageContent ModerationLegal TechCompound AI SystemsOn-premiseNamed Entity RecognitionMedical Record AnalysisCloudDocument LabelingText ClassificationAutomated MLFinancial Document ProcessingFinTechHuman-in-the-Loop MLManufacturingInformation ExtractionImage ClassificationTabularHealthcareSentiment AnalysisPython

Taxonomy

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

RAG & RetrievalPrimaryModel TrainingComputer VisionHealthcare & BiologyFinance & LegalOther AI / MLMLOps & InfrastructureDev Tools & AutomationIndustry: FinTechNLP & TextML Platform & Infrastructure

PM Skills

Scale & ReliabilityDeveloper Platform

Languages

Python100.0%

Timeline

Project created
Feb 26, 2016
Forked
Mar 22, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
May 2, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…