Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/LLaMA-Factory
Library/LLaMA-FactoryForked

hiyouga/LlamaFactory

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

View on GitHub↗Upstream hiyouga/LlamaFactory↗

Builder

hiyouga

hiyouga

hiyouga • individual

Stars

71,538

Using upstream star count

Forks

8,726

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 28, 2023

Project creation date

README Summary

[![GitHub Repo stars](https://img.shields.io/github/stars/hiyouga/LLaMA-Factory?style=social)](https://github.com/hiyouga/LLaMA-Factory/stargazers) [![GitHub last commit](https://img.shields.io/github/last-commit/hiyouga/LLaMA-Factory)](https://github.com/hiyouga/LLaMA-Factory/commits/main) [![GitHub contributors](https://img.shields.io/github/contributors/hiyouga/LLaMA-Factory?color=orange)](https://github.com/hiyouga/LLaMA-Factory/graphs/contributors) [![GitHub workflow](https://github.com/hiy

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Custom Dataset IntegrationDirect Preference Optimization (DPO)Distributed TrainingGradient CheckpointingLoRA Fine-tuningMixed Precision TrainingModel Evaluation and BenchmarkingModel Merging TechniquesModel QuantizationMulti-GPU TrainingParameter-Efficient Fine-Tuning (PEFT)Proximal Policy Optimization (PPO)QLoRA QuantizationReinforcement Learning from Human Feedback (RLHF)Supervised Fine-Tuning (SFT)Vision-Language Model Fine-tuning

Tags

Custom Dataset IntegrationDirect Preference Optimization (DPO)Distributed TrainingGradient CheckpointingLoRA Fine-tuningMixed Precision TrainingModel Evaluation and BenchmarkingModel Merging TechniquesModel QuantizationMulti-GPU TrainingParameter-Efficient Fine-Tuning (PEFT)Proximal Policy Optimization (PPO)QLoRA QuantizationReinforcement Learning from Human Feedback (RLHF)Supervised Fine-Tuning (SFT)Vision-Language Model Fine-tuningAI AgentsAI SafetyAWSAnthropic / ClaudeBenchmarkingCourseDPODatabaseDeepSeekDeepSpeedDistillationDockerEmbeddingsEvalsFSDPFinTechFine-TuningForkedGPTGPU / CUDAGRPOGemmaHealthcare AIHuggingFaceImage GenerationLLM ServingLarge Language ModelsLlamaLoRA / PEFTMLflowMMLUMergeKitMistralMulti-AgentMultimodal AIMusic TechOllamaOpenAIPhiPlanning / CoTPyTorchPythonQuantizationQwenRLHFReal-Time / StreamingReasoning ModelsReinforcement LearningResearch / PapersRoboticsSGLangSageMakerSecurityStable DiffusionSynthetic DataTRLTool UseTransformersTutorialUnslothVisualizationWeights & BiasesvLLM

Taxonomy

AI Trends

Parameter-Efficient TrainingModel AlignmentMultimodal AIOpen Source LLMsDemocratized AI TrainingEfficient Model CustomizationHuman Preference Learning

category

Model TrainingFoundation ModelsAI AgentsRAG & RetrievalEvals & BenchmarkingObservability & MonitoringInference & ServingGenerative MediaRoboticsMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsLearning ResourcesIndustry: HealthcareIndustry: FinTechIndustry: Audio & MusicSecurity & SafetyData Science & Analytics

Deployment Context

Self-hostedCloud GPUMulti-GPU ClustersOn-premise

Modalities

TextImageMultimodal

Skill Areas

Parameter-Efficient Fine-Tuning (PEFT)LoRA Fine-tuningQLoRA QuantizationSupervised Fine-Tuning (SFT)Reinforcement Learning from Human Feedback (RLHF)Direct Preference Optimization (DPO)Proximal Policy Optimization (PPO)Model QuantizationDistributed TrainingVision-Language Model Fine-tuningMulti-GPU TrainingGradient CheckpointingMixed Precision TrainingModel Merging TechniquesCustom Dataset IntegrationModel Evaluation and Benchmarking

tag

AI AgentsAI SafetyAWSAnthropic / ClaudeBenchmarkingCourseDPODatabaseDeepSeekDeepSpeedDistillationDockerEmbeddingsEvalsFSDPFinTechFine-TuningForkedGPTGPU / CUDAGRPOGemmaHealthcare AIHuggingFaceImage GenerationLLM ServingLarge Language ModelsLlamaLoRA / PEFTMLflowMMLUMergeKitMistralMulti-AgentMultimodal AIMusic TechOllamaOpenAIPhiPlanning / CoTPyTorchPythonQuantizationQwenRLHFReal-Time / StreamingReasoning ModelsReinforcement LearningResearch / PapersRoboticsSGLangSageMakerSecurityStable DiffusionSynthetic DataTRLTool UseTransformersTutorialUnslothVisualizationWeights & BiasesvLLM

Use Cases

Custom Chatbot DevelopmentDomain-Specific Language Model AdaptationInstruction Following Model TrainingMulti-turn Conversation TrainingVision-Language Task Fine-tuningModel Alignment and Safety TrainingFew-shot Learning EnhancementCross-lingual Model AdaptationCode Generation Model Fine-tuningMathematical Reasoning Enhancement

Recent Activity

Updated 6 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

RAG & RetrievalPrimaryEvals & BenchmarkingObservability & MonitoringInference & ServingMLOps & InfrastructureDev Tools & AutomationCloud & PlatformsLearning ResourcesIndustry: HealthcareIndustry: FinTechIndustry: Audio & MusicSecurity & SafetyData Science & AnalyticsFoundation ModelsAI AgentsModel TrainingGenerative MediaRoboticsSafety & AlignmentHealthcare & BiologyFinance & LegalMultimodal AISearch & KnowledgeOther AI / ML

PM Skills

Cost & EfficiencySafety & AlignmentUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper PlatformAI-Native Architecture

Languages

Python100.0%

Timeline

Project created
May 28, 2023
Forked
Nov 8, 2025
Your last push
6 months ago
Upstream last push
20 days ago
Tracked since
Nov 6, 2025

Similar Repos

pgvector cosine similarity · $0

Loading…