Library/llama2.c
Library/llama2.cForked

karpathy/llama2.c

llama2.c

Inference Llama 2 in one file of pure C

Builder

karpathy

karpathy

karpathy • individual

Stars

19,347

Using upstream star count

Forks

2,485

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jul 23, 2023

Project creation date

README Summary

A minimal implementation of Llama 2 inference written in pure C, contained in a single file with no dependencies. The project aims to demonstrate how to run Llama 2 models efficiently using only standard C libraries. It provides a lightweight alternative to heavy Python frameworks for running Llama 2 inference.

AI Dev Skills

Unmapped

Transformer ArchitectureLanguage Model InferenceModel OptimizationSystems ProgrammingNeural Network ImplementationQuantization TechniquesMemory Management

Tags

Transformer ArchitectureLanguage Model InferenceModel OptimizationSystems ProgrammingNeural Network ImplementationQuantization TechniquesMemory ManagementMinimal AI InfrastructureEdge/MobileOn-premiseEducational AI ImplementationTextIoTSmall Language ModelsEdge ComputingMinimal AI InferenceSelf-hostedResearch PrototypingOn-device AIEducationEmbedded Language ProcessingLow-resource Text GenerationEmbedded SystemsC

Taxonomy

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

prototype
Quality
medium
Maturity
prototype

Categories

Other AI / MLPrimaryInference & ServingML Platform & InfrastructureEdge & Mobile AISearch & KnowledgeDev Tools & AutomationLearning ResourcesFoundation Models

PM Skills

Developer Platform

Languages

C100.0%

Timeline

Project created
Jul 23, 2023
Forked
Mar 13, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
Aug 6, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…