Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/DataDreamer
Library/DataDreamerForked

datadreamer-dev/DataDreamer

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

View on GitHub↗Upstream datadreamer-dev/DataDreamer↗

Builder

datadreamer-dev

datadreamer-dev

datadreamer-dev • individual

Stars

1,111

Using upstream star count

Forks

59

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jun 2, 2023

Project creation date

README Summary

<p align="center"> <a href="https://datadreamer.dev"><img src="https://datadreamer.dev/docs/latest/_static/logo.svg" alt="DataDreamer" style="max-width: 100%;"></a><br /> <a href="https://datadreamer.dev"><b>https://datadreamer.dev</b></a> </p> <p align="center"> <b>Prompt. Generate Synthetic Data. Train & Align Models.</b><br /><br /> <a href="https://github.com/datadreamer-dev/DataDreamer/actions/workflows/release.yml"><img src="https://img.shields.io/github/actions/workflow/status/da

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Data AugmentationLarge Language Model TrainingMachine Learning Pipeline DevelopmentModel AlignmentModel Fine-tuningPrompt EngineeringSynthetic Data Generation

Tags

Data AugmentationLarge Language Model TrainingMachine Learning Pipeline DevelopmentModel AlignmentModel Fine-tuningPrompt EngineeringSynthetic Data GenerationCachingForkedHuggingFaceLarge Language ModelsLiteLLMLoRA / PEFTOpen SourcePythonQuantizationResearch / PapersSynthetic Data

Taxonomy

AI Trends

Synthetic DataModel AlignmentPrompt EngineeringCustom Model Training

category

Foundation ModelsModel TrainingInference & ServingLearning Resources

Deployment Context

Self-hostedCloud API

Modalities

Text

Skill Areas

Synthetic Data GenerationModel Fine-tuningPrompt EngineeringLarge Language Model TrainingModel AlignmentData AugmentationMachine Learning Pipeline Development

tag

CachingForkedHuggingFaceLarge Language ModelsLiteLLMLoRA / PEFTOpen SourcePythonQuantizationResearch / PapersSynthetic Data

Use Cases

Synthetic Dataset CreationModel Fine-tuning WorkflowsAI Training Data GenerationPrompt-based Data AugmentationCustom Model Training Pipelines

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

prototype
Quality
medium
Maturity
prototype

Categories

Inference & ServingPrimaryLearning ResourcesFoundation ModelsModel TrainingSearch & Knowledge

PM Skills

Cost & EfficiencyData & Evaluation

Languages

Python100.0%

Timeline

Project created
Jun 2, 2023
Forked
Mar 22, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
Feb 2, 2025

Similar Repos

pgvector cosine similarity · $0

Loading…