Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/awesome-ml-model-compression
Library/awesome-ml-model-compressionForked

cedrickchee/awesome-ml-model-compression

awesome-ml-model-compression

Awesome machine learning model compression research papers, quantization, tools, and learning material.

View on GitHub↗Upstream cedrickchee/awesome-ml-model-compression↗

Builder

cedrickchee

cedrickchee

cedrickchee • individual

Stars

543

Using upstream star count

Forks

63

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Dec 6, 2018

Project creation date

README Summary

Awesome ML Model Compression [![Awesome](https://awesome.re/badge.svg)](https://awesome.re)

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Deep Learning Architecture DesignEfficient Neural NetworksHardware-Aware Model DesignKnowledge DistillationModel CompressionModel OptimizationNeural Network QuantizationPruning Techniques

Tags

Deep Learning Architecture DesignEfficient Neural NetworksHardware-Aware Model DesignKnowledge DistillationModel CompressionModel OptimizationNeural Network QuantizationPruning TechniquesAI SafetyC++Computer VisionDeep LearningDeepSpeedDistillationEmbeddingsEvalsFine-TuningForkedGPU / CUDAHuggingFaceLarge Language ModelsLoRA / PEFTMachine LearningMobileOpen SourceOpenAIQuantizationReal-Time / StreamingResearch / PapersSpeech to TextTensorFlowTransformersTutorialllama.cpp

Taxonomy

AI Trends

On-device AISmall Language ModelsEfficient AIGreen AIEdge AI

category

Foundation ModelsRAG & RetrievalModel TrainingEvals & BenchmarkingInference & ServingGenerative MediaComputer VisionLearning ResourcesSecurity & Safety

Deployment Context

Edge/MobileOn-premiseCloud APIEmbedded Systems

Modalities

TextImageAudioVideoMultimodal

Skill Areas

Model CompressionNeural Network QuantizationKnowledge DistillationPruning TechniquesModel OptimizationDeep Learning Architecture DesignEfficient Neural NetworksHardware-Aware Model Design

tag

AI SafetyC++Computer VisionDeep LearningDeepSpeedDistillationEmbeddingsEvalsFine-TuningForkedGPU / CUDAHuggingFaceLarge Language ModelsLoRA / PEFTMachine LearningMobileModel OptimizationOpen SourceOpenAIQuantizationReal-Time / StreamingResearch / PapersSpeech to TextTensorFlowTransformersTutorialllama.cpp

Use Cases

Mobile AI Application DevelopmentEdge Computing Model DeploymentResource-Constrained InferenceReal-time Model ServingIoT Device AI IntegrationBattery-Efficient AI Systems

Recent Activity

Updated 1 years ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
medium
Maturity
research

Categories

RAG & RetrievalPrimaryEvals & BenchmarkingInference & ServingLearning ResourcesSecurity & SafetyFoundation ModelsModel TrainingGenerative MediaComputer VisionSafety & AlignmentEdge & Mobile AISearch & KnowledgeOther AI / ML

PM Skills

Cost & EfficiencySafety & AlignmentUser ExperienceScale & ReliabilityData & EvaluationProduct Discovery

Languages

No language breakdown recorded.

Timeline

Project created
Dec 6, 2018
Forked
Mar 23, 2026
Your last push
1 years ago
Upstream last push
1 years ago
Tracked since
Sep 21, 2024

Similar Repos

pgvector cosine similarity · $0

Loading…