Library/heretic
Library/hereticForked

p-e-w/heretic

heretic

Fully automatic censorship removal for language models

Builder

p-e-w

p-e-w

p-e-w • individual

Stars

18,175

Using upstream star count

Forks

1,808

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Sep 21, 2025

Project creation date

README Summary

Heretic is a Python tool that automatically removes censorship and safety restrictions from language models by analyzing and modifying their behavior patterns. It works by identifying and neutralizing the mechanisms that cause models to refuse certain requests or topics. The tool aims to restore uncensored functionality to models that have been restricted through safety fine-tuning.

AI Dev Skills

Unmapped

Language Model Safety ResearchNeural Network ManipulationTransformer ArchitectureModel Inference OptimizationSafety Alignment ResearchAdversarial Machine Learning

Tags

Language Model Safety ResearchNeural Network ManipulationTransformer ArchitectureModel Inference OptimizationSafety Alignment ResearchAdversarial Machine LearningAI ResearchLanguage Model AlignmentTextAcademic ResearchSelf-hostedAI Alignment ResearchSafety ResearchLanguage Model UncensoringAdversarial AI ResearchAI SafetyResearch EnvironmentAdversarial Testing of Language ModelsPython

Taxonomy

Recent Activity

Updated 27 days ago

7 Days

0

30 Days

0

90 Days

0

Quality

research
Quality
medium
Maturity
research

Categories

Learning ResourcesPrimaryEvals & BenchmarkingInference & ServingSafety & AlignmentSearch & KnowledgeOther AI / MLFoundation ModelsRobotics

PM Skills

Product Discovery

Languages

Python100.0%

Timeline

Project created
Sep 21, 2025
Forked
Mar 13, 2026
Your last push
27 days ago
Upstream last push
6 days ago
Tracked since
Mar 17, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…