Library/impForked

kekzl/imp

imp

High-performance LLM inference engine in C++/CUDA for NVIDIA Blackwell GPUs (RTX 5090)

View on GitHub↗Upstream kekzl/imp↗

Builder

kekzl

kekzl • individual

Stars

Using upstream star count

Forks

Using upstream fork count

Open Issues

Activity Score

0/100

0 commits in 30d

Created

Feb 23, 2026

Project creation date

README Summary

Community Evaluation

Loading…

AI Dev Skills

Unmapped

CUDA ProgrammingGPU ComputingGPU Kernel DevelopmentLLM Inference OptimizationLow-level Systems ProgrammingMemory ManagementNeural Network Acceleration

Recent Activity

Updated 2 months ago

7 Days

30 Days

90 Days

docs: update benchmarks to v0.3, improve quickstart

Raphael Friedmann • Mar 16, 2026

d508b5e

perf: micro-optimizations across hot paths (Sprint 4)

Raphael Friedmann • Mar 15, 2026

4835c47

perf: single-token sampling fast path + sample_single_from_logits

Raphael Friedmann • Mar 15, 2026

4a7e86a

Quality

prototype

Quality: low
Maturity: prototype

PM Skills

Cost & EfficiencyUser ExperienceScale & ReliabilityData & EvaluationDeveloper PlatformAI-Native Architecture

Languages

Cuda100.0%

Timeline

Project created: Feb 23, 2026
Forked: Mar 12, 2026
Your last push: 2 months ago
Upstream last push: 16 days ago
Tracked since: Mar 17, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…

Library/impForked

kekzl/imp

imp

High-performance LLM inference engine in C++/CUDA for NVIDIA Blackwell GPUs (RTX 5090)

View on GitHub↗Upstream kekzl/imp↗

Builder

kekzl

kekzl • individual

Stars

Using upstream star count

Forks

Using upstream fork count

Open Issues

Activity Score

0/100

0 commits in 30d

Created

Feb 23, 2026

Project creation date

README Summary

Community Evaluation

Loading…

AI Dev Skills

Unmapped

CUDA ProgrammingGPU ComputingGPU Kernel DevelopmentLLM Inference OptimizationLow-level Systems ProgrammingMemory ManagementNeural Network Acceleration

Recent Activity

Updated 2 months ago

7 Days

30 Days

90 Days

docs: update benchmarks to v0.3, improve quickstart

Raphael Friedmann • Mar 16, 2026

d508b5e

perf: micro-optimizations across hot paths (Sprint 4)

Raphael Friedmann • Mar 15, 2026

4835c47

perf: single-token sampling fast path + sample_single_from_logits

Raphael Friedmann • Mar 15, 2026

4a7e86a

Quality

prototype

Quality: low
Maturity: prototype

PM Skills

Cost & EfficiencyUser ExperienceScale & ReliabilityData & EvaluationDeveloper PlatformAI-Native Architecture

Languages

Cuda100.0%

Timeline

Project created: Feb 23, 2026
Forked: Mar 12, 2026
Your last push: 2 months ago
Upstream last push: 16 days ago
Tracked since: Mar 17, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…

imp

README Summary

Community Evaluation

AI Dev Skills

Tags

Taxonomy

Recent Activity

Quality

Categories

PM Skills

Languages

Timeline

Similar Repos

imp

README Summary

Community Evaluation

AI Dev Skills

Tags

Taxonomy

Recent Activity

Quality

Categories

PM Skills

Languages

Timeline

Similar Repos