nicolevin (u/nicolevin)

nicolevin commented on DeepSeek's Hidden Bias: How We Cut It by 76% Without Performance Loss hirundo.io/blog/deepseek-... · Posted by u/nicolevin

nicolevin · 7 months ago

reach out at @nicilevv on X for questions

nicolevin commented on DeepSeek's Hidden Bias: How We Cut It by 76% Without Performance Loss hirundo.io/blog/deepseek-... · Posted by u/nicolevin

nicolevin · 7 months ago

Bias-Unlearned DeepSeek-R1-Distill-Llama-8B here: https://huggingface.co/hirundo-io/DeepSeek-R1-Distill-Llama-...

nicolevin commented on DeepSeek's Hidden Bias: How We Cut It by 76% Without Performance Loss hirundo.io/blog/deepseek-... · Posted by u/nicolevin

nicolevin · 7 months ago

DeepSeek-R1 (8B) exhibited 2x more bias than base Llama. We applied targeted unlearning, reduced bias by up to 76% across race/gender/nationality, while maintaining model performance (TruthfulQA: 9.8→9.9, LogiQA: 42.6%→42.5%). Done in ~1hr on consumer hardware. Debiased model on HuggingFace.

Posted by u/nicolevin 7 months ago

DeepSeek's Hidden Bias: How We Cut It by 76% Without Performance Loss hirundo.io/blog/deepseek-...

u/nicolevin

KarmaCake day52December 20, 2023View Original