Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/dolly
Library/dollyForked

databrickslabs/dolly

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

View on GitHub↗Upstream databrickslabs/dolly↗

Builder

databrickslabs

databrickslabs

databrickslabs • individual

Stars

10,790

Using upstream star count

Forks

1,139

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Mar 24, 2023

Project creation date

README Summary

Databricks’ [Dolly](https://huggingface.co/databricks/dolly-v2-12b) is an instruction-following large language model trained on the Databricks machine learning platform that is licensed for commercial use. Based on `pythia-12b`, Dolly is trained on ~15k instruction/response fine tuning records [`databricks-dolly-15k`](https://huggingface.co/datasets/databricks/databricks-dolly-15k) generated by Databricks employees in capability domains from the InstructGPT paper, including brainstorming, classi

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Dataset CurationDeep Learning Model DevelopmentHuman Feedback LearningInstruction FollowingLarge Language Model TrainingModel EvaluationModel Fine-tuningNatural Language GenerationTransformer Architecture

Tags

Dataset CurationDeep Learning Model DevelopmentHuman Feedback LearningInstruction FollowingLarge Language Model TrainingModel EvaluationModel Fine-tuningNatural Language GenerationTransformer ArchitectureBenchmarkingDeepSpeedFine-TuningForkedHuggingFaceLarge Language ModelsMachine LearningOpenAIPyTorchPythonSparkTransformersTutorial

Taxonomy

AI Trends

Open Source AIInstruction Following ModelsHuman-Curated Training DataTransparent AI Development

category

Foundation ModelsModel TrainingEvals & BenchmarkingLearning Resources

Deployment Context

Self-hostedCloud APIOn-premise

Modalities

Text

Skill Areas

Large Language Model TrainingInstruction FollowingHuman Feedback LearningModel Fine-tuningNatural Language GenerationTransformer ArchitectureDeep Learning Model DevelopmentDataset CurationModel Evaluation

tag

BenchmarkingDeepSpeedFine-TuningForkedHuggingFaceLarge Language ModelsMachine LearningOpenAIPyTorchPythonSparkTransformersTutorial

Use Cases

Instruction FollowingQuestion AnsweringText GenerationConversational AIResearch Benchmarking

Recent Activity

Updated 2 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
high
Maturity
research

Categories

Foundation ModelsPrimaryModel TrainingEvals & BenchmarkingOther AI / MLLearning Resources

PM Skills

Data & Evaluation

Languages

Python100.0%

Timeline

Project created
Mar 24, 2023
Forked
Mar 22, 2026
Your last push
2 years ago
Upstream last push
2 years ago
Tracked since
Jun 30, 2023

Similar Repos

pgvector cosine similarity · $0

Loading…