Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/DeepSeek-V3
Library/DeepSeek-V3Forked

deepseek-ai/DeepSeek-V3

DeepSeek-V3

<!-- markdownlint-disable first-line-h1 --> <!-- markdownlint-disable html --> <!-- markdownlint-disable no-duplicate-header -->

View on GitHub↗Upstream deepseek-ai/DeepSeek-V3↗

Builder

DeepSeek

DeepSeek

deepseek-ai • ai-lab

Stars

103,645

Using upstream star count

Forks

16,739

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Dec 26, 2024

Project creation date

README Summary

<!-- markdownlint-disable first-line-h1 --> <!-- markdownlint-disable html --> <!-- markdownlint-disable no-duplicate-header -->

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Distributed TrainingLarge Language Model DevelopmentMixture of Experts ArchitectureModel ScalingTransformer Architecture

Tags

Distributed TrainingLarge Language Model DevelopmentMixture of Experts ArchitectureModel ScalingTransformer ArchitectureAiderAnthropic / ClaudeBenchmarkingClaudeContext EngineeringDeepSeekDistillationEvalsForkedHaystackHuggingFaceHumanEvalKV CacheLLM ServingLarge Language ModelsLlamaMMLUModel OptimizationOpenAIPlanning / CoTPyTorchPythonQuantizationQwenReinforcement LearningResearch / PapersSGLangSpeculative DecodingTensorRTTransformersvLLM

Taxonomy

AI Trends

Mixture of ExpertsLarge Language ModelsModel Scaling

category

Foundation ModelsAI AgentsModel TrainingEvals & BenchmarkingInference & ServingLearning Resources

Deployment Context

Self-hostedCloud API

Modalities

Text

Skill Areas

Mixture of Experts ArchitectureLarge Language Model DevelopmentTransformer ArchitectureModel ScalingDistributed Training

tag

AiderAnthropic / ClaudeBenchmarkingClaudeContext EngineeringDeepSeekDistillationEvalsForkedHaystackHuggingFaceHumanEvalKV CacheLLM ServingLarge Language ModelsLlamaMMLUModel OptimizationOpenAIPlanning / CoTPyTorchPythonQuantizationQwenReinforcement LearningResearch / PapersSGLangSpeculative DecodingTensorRTTransformersvLLM

Use Cases

Natural Language GenerationText UnderstandingConversational AILanguage Model Research

Recent Activity

Updated 9 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
low
Maturity
research

Categories

Evals & BenchmarkingPrimaryInference & ServingLearning ResourcesFoundation ModelsAI AgentsModel TrainingCoding & Dev ToolsSearch & KnowledgeOther AI / ML

PM Skills

Cost & EfficiencyData & EvaluationAI-Native Architecture

Languages

Python100.0%

Timeline

Project created
Dec 26, 2024
Forked
Mar 13, 2026
Your last push
9 months ago
Upstream last push
9 months ago
Tracked since
Aug 28, 2025

Similar Repos

pgvector cosine similarity · $0

Loading…