Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/BitNet
Library/BitNetForked

microsoft/BitNet

BitNet

Official inference framework for 1-bit LLMs

View on GitHub↗Upstream microsoft/BitNet↗

Builder

Microsoft

Microsoft

microsoft • big-tech

Stars

39,120

Using upstream star count

Forks

3,569

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Aug 5, 2024

Project creation date

README Summary

bitnet.cpp [![License: MIT](https://img.shields.io/badge/license-MIT-blue.svg)](https://opensource.org/licenses/MIT) ![version](https://img.shields.io/badge/version-1.0-blue)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

1-bit Weight QuantizationEfficient Deep LearningLarge Language Model InferenceLow-precision Neural NetworksMemory-efficient ComputingModel CompressionNeural Network QuantizationTransformer Architecture

Tags

1-bit Weight QuantizationEfficient Deep LearningLarge Language Model InferenceLow-precision Neural NetworksMemory-efficient ComputingModel CompressionNeural Network QuantizationTransformer ArchitectureBenchmarkingC++EmbeddingsEvalsForkedHuggingFaceLlamaPrompt EngineeringPythonQuantizationResearch / PapersTransformersllama.cpp

Taxonomy

AI Trends

Model CompressionEfficient AIOn-device AIQuantized Neural NetworksResource-efficient LLMs

category

Foundation ModelsAI AgentsRAG & RetrievalEvals & BenchmarkingLearning Resources

Deployment Context

Edge/MobileSelf-hostedOn-premise

Modalities

Text

Skill Areas

Neural Network QuantizationLarge Language Model Inference1-bit Weight QuantizationModel CompressionEfficient Deep LearningLow-precision Neural NetworksTransformer ArchitectureMemory-efficient Computing

tag

BenchmarkingC++EmbeddingsEvalsForkedHuggingFaceLlamaPrompt EngineeringPythonQuantizationResearch / PapersTransformersllama.cpp

Use Cases

Resource-constrained LLM DeploymentEdge AI Language ProcessingMemory-efficient Text GenerationLow-power Language Model Inference

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

3

Update demo link in README.md

Yan Xia • Mar 10, 2026

01eb415

Merge pull request #421 from microsoft/fix/unsafe-deserialization-gpu-pipeline

tsong-ms • Mar 9, 2026

0fdaa16

fix: add weights_only=True to torch.load in GPU inference pipeline

Ubuntu • Mar 9, 2026

eb60fc3

Quality

research
Quality
medium
Maturity
research

Categories

RAG & RetrievalPrimaryEvals & BenchmarkingLearning ResourcesFoundation ModelsAI AgentsSearch & Knowledge

PM Skills

Cost & EfficiencyData & EvaluationProduct Discovery

Languages

Python100.0%

Timeline

Project created
Aug 5, 2024
Forked
Mar 12, 2026
Your last push
2 months ago
Upstream last push
2 months ago
Tracked since
Mar 10, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…