Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/mergekit
Library/mergekitForked

arcee-ai/mergekit

mergekit

Tools for merging pretrained large language models.

View on GitHub↗Upstream arcee-ai/mergekit↗

Builder

arcee-ai

arcee-ai

arcee-ai • individual

Stars

7,104

Using upstream star count

Forks

720

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Aug 21, 2023

Project creation date

README Summary

[![License: LGPL v3](https://img.shields.io/badge/License-LGPL_v3-blue.svg)](https://www.gnu.org/licenses/lgpl-3.0) [![GitHub Actions Workflow Status](https://img.shields.io/github/actions/workflow/status/arcee-ai/mergekit/pre-commit.yml?label=Tests)](https://github.com/arcee-ai/mergekit/actions/workflows/pre-commit.yml) [![Arcee Discord](https://img.shields.io/badge/Arcee%20Discord-Arcee%20Discord?logo=discord&logoColor=white&color=5865F2)](https://discord.gg/arceeai)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Gradient-based Model FusionLarge Language Model ArchitectureModel CompressionModel Ensemble MethodsModel Merging TechniquesNeural Network Weight InterpolationParameter Space AnalysisTransformer Model Optimization

Tags

Gradient-based Model FusionLarge Language Model ArchitectureModel CompressionModel Ensemble MethodsModel Merging TechniquesNeural Network Weight InterpolationParameter Space AnalysisTransformer Model OptimizationBackendBenchmarkingDistillationEmbeddingsForkedGPU / CUDAHuggingFaceLlamaLoRA / PEFTMergeKitMistralOpenAIPyTorchPythonSpeculative DecodingTransformers

Taxonomy

AI Trends

Small Language ModelsModel EfficiencyOpen Source AIDemocratized AI Development

category

Foundation ModelsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingDev Tools & Automation

Deployment Context

Self-hostedOn-premise

Modalities

Text

Skill Areas

Model Merging TechniquesLarge Language Model ArchitectureNeural Network Weight InterpolationTransformer Model OptimizationModel CompressionParameter Space AnalysisGradient-based Model FusionModel Ensemble Methods

tag

BackendBenchmarkingDistillationEmbeddingsForkedGPU / CUDAHuggingFaceLlamaLoRA / PEFTMergeKitMistralOpenAIPyTorchPythonSpeculative DecodingTransformers

Use Cases

Creating Domain-Specialized Language ModelsModel Capability EnhancementKnowledge Transfer Between ModelsModel Performance OptimizationCustom Model DevelopmentResearch Model Experimentation

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

1

Add Qwen3-VL dense models (#670)

pengdurice • Mar 15, 2026

7111360

Add Qwen3-VL support (#665)

pengdurice • Feb 28, 2026

85aba9b

Quality

beta
Quality
high
Maturity
beta

Categories

RAG & RetrievalPrimaryModel TrainingEvals & BenchmarkingInference & ServingOther AI / MLDev Tools & AutomationFoundation Models

PM Skills

Cost & EfficiencyData & EvaluationProduct Discovery

Languages

Python100.0%

Timeline

Project created
Aug 21, 2023
Forked
Mar 13, 2026
Your last push
2 months ago
Upstream last push
28 days ago
Tracked since
Mar 17, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…