Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/snorkel
Library/snorkelForked

snorkel-team/snorkel

snorkel

A system for quickly generating training data with weak supervision

View on GitHub↗Upstream snorkel-team/snorkel↗

Builder

snorkel-team

snorkel-team

snorkel-team • individual

Stars

5,970

Using upstream star count

Forks

855

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Feb 26, 2016

Project creation date

README Summary

***Programmatically Build and Manage Training Data***

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Active LearningData AugmentationData-Centric AILabel Function DesignMachine Learning Pipeline DesignMulti-Source LearningProbabilistic ModelsProgrammatic LabelingTraining Data GenerationWeak Supervision

Tags

Active LearningData AugmentationData-Centric AILabel Function DesignMachine Learning Pipeline DesignMulti-Source LearningProbabilistic ModelsProgrammatic LabelingTraining Data GenerationWeak SupervisionDockerForkedMachine LearningPyTorchPythonRoadmapTutorial

Taxonomy

AI Trends

Data-Centric AICompound AI SystemsHuman-in-the-Loop MLAutomated ML

category

Learning ResourcesModel TrainingMLOps & Infrastructure

Deployment Context

Self-hostedCloudOn-premise

Industries

HealthcareFinTechLegal TechE-commerceMedia & EntertainmentManufacturingGovernment

Modalities

TextImageTabular

Skill Areas

Weak SupervisionTraining Data GenerationProgrammatic LabelingData-Centric AILabel Function DesignMulti-Source LearningProbabilistic ModelsActive LearningData AugmentationMachine Learning Pipeline Design

tag

DockerForkedMachine LearningPyTorchPythonRoadmapTutorial

Use Cases

Text ClassificationNamed Entity RecognitionInformation ExtractionSentiment AnalysisDocument LabelingImage ClassificationMedical Record AnalysisFinancial Document ProcessingContent Moderation

Recent Activity

Updated 2 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

Model TrainingPrimaryOther AI / MLLearning ResourcesMLOps & Infrastructure

PM Skills

Scale & Reliability

Languages

Python100.0%

Timeline

Project created
Feb 26, 2016
Forked
Mar 22, 2026
Your last push
2 years ago
Upstream last push
1 months ago
Tracked since
May 2, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…