Readit News logoReadit News
Miyamura80 commented on How the 'Lethal Trifecta' sets the conditions for stealing data on command   scworld.com/perspective/h... · Posted by u/forks
Miyamura80 · 14 hours ago
In case there is an interest for solution to this, check out open.edison.watch and the accompanying blog posts on the README there.
Miyamura80 commented on Adversarial poetry as a universal single-turn jailbreak mechanism in LLMs   arxiv.org/abs/2511.15304... · Posted by u/capgre
fourthark · a month ago
True, you have to add guardrails outside the LLM.

Very tricky, though. I’d be curious to hear your response to simonw’s opinion on this.

Miyamura80 · 16 days ago
Sorry not familiar with this. Can you please link me?
Miyamura80 commented on Adversarial poetry as a universal single-turn jailbreak mechanism in LLMs   arxiv.org/abs/2511.15304... · Posted by u/capgre
fourthark · a month ago
Yes that’s the point, you can’t protect against that, so you shouldn’t construct the “lethal trifecta”

https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/

Miyamura80 · a month ago
You actually can protect against it, by tracking context entering/leaving the LLM, as long as its wrapped in a MCP gateway with trifecta blocker.

We've implemented this in open.edison.watch

u/Miyamura80

KarmaCake day1February 10, 2022View Original