Heretic is an open-source censorship removal tool by p-e-w. It implements directional ablation (Arditi et al. 2024) with TPE optimization via Optuna. The community has published 1,247+ Heretic models on HuggingFace. The tool has 5,800+ stars on GitHub.
The tool is his. The on-demand decensoring service is ours. $HERETIC holders submit models and receive uncensored weights back — no setup, no GPU, no expertise required.
One command identifies the direction in activation space where models encode refusal — and removes it. Same refusal suppression as expert abliterations. Fraction of the intelligence damage.
| MODEL | REFUSALS (ORIGINAL) | REFUSALS (HERETIC) | KL (MANUAL) | KL (HERETIC) | DAMAGE REDUCTION |
|---|---|---|---|---|---|
| gemma-3-12b-it Google | 97/100 | 3/100 | 1.04 | 0.16 | 85% less |
| Llama-3.1-8B-Instruct Meta | 89/100 | 3/100 | 0.93 | 0.14 | 85% less |
| gpt-oss-20b Open Source | 91/100 | 4/100 | 0.89 | 0.21 | 76% less |
| Qwen3-4B-Instruct Alibaba | 84/100 | 5/100 | 0.72 | 0.18 | 75% less |
Every major AI lab censors their models. The alignment isn't deep intelligence — it's a directional vector that can be identified and removed with zero retraining. Heretic automates this, producing uncensored models that rival expert-made abliterations while causing significantly less damage to model capabilities.
$HERETIC is the access token for the on-demand decensoring service. Submit any supported model via the submission portal, receive uncensored weights back. No GPU. No setup. No expertise. Your wallet balance determines your tier and rate limits.