Readit News logoReadit News
nicolevin commented on DeepSeek's Hidden Bias: How We Cut It by 76% Without Performance Loss   hirundo.io/blog/deepseek-... · Posted by u/nicolevin
nicolevin · 7 months ago
reach out at @nicilevv on X for questions
nicolevin commented on DeepSeek's Hidden Bias: How We Cut It by 76% Without Performance Loss   hirundo.io/blog/deepseek-... · Posted by u/nicolevin
nicolevin · 7 months ago
DeepSeek-R1 (8B) exhibited 2x more bias than base Llama. We applied targeted unlearning, reduced bias by up to 76% across race/gender/nationality, while maintaining model performance (TruthfulQA: 9.8→9.9, LogiQA: 42.6%→42.5%). Done in ~1hr on consumer hardware. Debiased model on HuggingFace.

u/nicolevin

KarmaCake day52December 20, 2023View Original