llama2.c

Inference Llama 2 in one file of pure C

View on GitHub↗Upstream karpathy/llama2.c↗

Builder

karpathy

karpathy • individual

Stars

19,347

Using upstream star count

Forks

2,485

Using upstream fork count

Open Issues

Activity Score

0/100

0 commits in 30d

Created

Jul 23, 2023

Project creation date

README Summary

A minimal implementation of Llama 2 inference written in pure C, contained in a single file with no dependencies. The project aims to demonstrate how to run Llama 2 models efficiently using only standard C libraries. It provides a lightweight alternative to heavy Python frameworks for running Llama 2 inference.

AI Dev Skills

Unmapped

Transformer ArchitectureLanguage Model InferenceModel OptimizationSystems ProgrammingNeural Network ImplementationQuantization TechniquesMemory Management

Recent Activity

Updated 1 years ago

7 Days

30 Days

90 Days

Quality

prototype

Quality: medium
Maturity: prototype

PM Skills

Developer Platform

Languages

C100.0%

Timeline

Project created: Jul 23, 2023
Forked: Mar 13, 2026
Your last push: 1 years ago
Upstream last push: 1 years ago
Tracked since: Aug 6, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…