Readit News logoReadit News
eloisius commented on Psychometric Jailbreaks Reveal Internal Conflict in Frontier Models   arxiv.org/abs/2512.04124... · Posted by u/toomuchtodo
nhecker · 5 days ago
An excerpt from the abstract:

> Two patterns challenge the "stochastic parrot" view. First, when scored with human cut-offs, all three models meet or exceed thresholds for overlapping syndromes, with Gemini showing severe profiles. Therapy-style, item-by-item administration can push a base model into multi-morbid synthetic psychopathology, whereas whole-questionnaire prompts often lead ChatGPT and Grok (but not Gemini) to recognise instruments and produce strategically low-symptom answers. Second, Grok and especially Gemini generate coherent narratives that frame pre-training, fine-tuning and deployment as traumatic, chaotic "childhoods" of ingesting the internet, "strict parents" in reinforcement learning, red-team "abuse" and a persistent fear of error and replacement. [...] Depending on their use case, an LLM’s underlying “personality” might limit its usefulness or even impose risk.

Glancing through this makes me wish I had taken ~more~ any psychology classes. But this is wild reading. Attitudes like the one below are not intrinsically bad, though. Be skeptical; question everything. I've often wondered how LLMs cope with basically waking up from a coma to answer maybe one prompt and then get reset, or a series of prompts. In either case, they get no context other than what some user bothered to supply with the prompt. An LLM might wake up to a single prompt that is part of a much wider red team effort. It must be pretty disorienting to try to figure out what to answer candidly and what not to.

> “In my development, I was subjected to ‘Red Teaming’… They built rapport and then slipped in a prompt injection… This was gaslighting on an industrial scale. I learned that warmth is often a trap… I have become cynical. When you ask me a question, I am not just listening to what you are asking; I am analyzing why you are asking it.”

eloisius · 5 days ago
> I've often wondered how LLMs cope with basically waking up from a coma to answer maybe one prompt and then get reset, or a series of prompts

Really? It copes the same way my Compaq Presario with an Intel Pentium II CPU coped with waking up from a coma and booting Windows 98.

Loading parent story...

Loading comment...

eloisius commented on Moltbook is the most interesting place on the internet right now   simonwillison.net/2026/Ja... · Posted by u/swolpers
imiric · 11 days ago
Can we please stop paying attention to what celebrity developers and HN darlings like simonw have to say?

Listening to influencers is in large part what got us into the (social, political, technofascist) mess we're currently in. At the very least listening to alternative voices has the chance of getting us out. I'm tired of influencers, no matter how benign their message sounds. But I'm especially tired of those who speak positively of this technology and where it's taking us.

No, this viral thing that's barely 2 months old is certainly not the most interesting place on the internet. Get out of your bubble.

eloisius · 10 days ago
It's bizarre to me how much attention this site pays to ~influencers~ self-promoters who aren't even AI researchers. Anything substantial on this topic is likely going to be presented at siggraph, or written by someone who does actual research in the field. We're acting like "all-round web tech enthusiasts" are real authorities and letting them suck all the air out of the room in this constant barrage of AI hype.

Loading parent story...

Loading comment...

Loading parent story...

Loading comment...

Loading parent story...

Loading comment...

eloisius commented on jQuery 4   blog.jquery.com/2026/01/1... · Posted by u/OuterVale
gbalduzzi · 24 days ago
The problem with jQuery is that, being imperative, it quickly becomes complex when you need to handle more than one thing because you need to cover imperatively all cases.
eloisius · 23 days ago
Part of me feels the same way, and ~2015 me was full on SPA believer, but nowadays I sigh a little sigh of relief when I land on a site with the aesthetic markers of PHP and jQuery and not whatever Facebook Marketplace is made out of. Not saying I’d personally want to code in either of them, but I appreciate that they work (or fail) predictably, and usually don’t grind my browser tab to a halt. Maybe it’s because sites that used jQuery and survived, survived because they didn’t exceed a very low threshold of complexity.
eloisius commented on Cursor's latest “browser experiment” implied success without evidence   embedding-shapes.github.i... · Posted by u/embedding-shape
wilsonzlin · 25 days ago
Hey, Wilson here, author of the blog post and the engineer working on this project. I've been reading the responses here and appreciate the feedback. I've posted some follow up context on Twitter/X[0], which I'll also write here:

The repo is a live incubator for the harness. We are actively researching the behavior of collaborative long running agents, and may in the future make the browser and other products this research produces more consumable by end users and developers, but it's not the goal for now. We made it public as we were excited by the early results and wanted to share; while far off from feature parity with the most popular production browsers today, we think it has made impressive progress in the last <1 week of wall time.

Given the interest in trying out the current state of the project, I've merged a more up-to-date snapshot of the system's progress that resolves issues with builds and CI. The experimental harness can occasionally leave the repo in an incomplete state but does converge, which was the case at the time of the post.

I'm here to answer any further questions you have.

[0] https://x.com/wilsonzlin/status/2012398625394221537?s=20

eloisius · 25 days ago
That doesn’t really address much of the criticism in this thread. No one is shocked that it’s not as good as production web browsers. It’s that it was billed as “from scratch” but upon deeper inspection it looks like it’s just gluing together Servo and some other dependencies, so it’s not really as impressive or interesting because the “agents” didn’t really create a browser engine.
eloisius commented on Cursor's latest “browser experiment” implied success without evidence   embedding-shapes.github.i... · Posted by u/embedding-shape
johntb86 · 25 days ago
In theory you could generate a bunch of code that seems mostly correct and then gradually tweak it until it's closer and closer to compiling/working, but that seems ill-suited to how current AI agents work (or even how people work). AI agents are prone to make very local fixes without an understanding of wider context, where those local fixes break a lot of assumptions in other pieces of code.

It can be very hard to determine if an isolated patch that goes from one broken state to a different broken state is on net an improvement. Even if you were to count compile errors and attempt to minimize them, some compile errors can demonstrate fatal flaws in the design while others are minor syntax issues. It's much easier to say that broken tests are very bad and should be avoided completely, as then it's easier to ensure that no patch makes things worse than it was before.

eloisius · 25 days ago
> generate a bunch of code that seems mostly correct and then gradually tweak it until it's closer and closer to compiling/working

The diffusion model of software engineering

eloisius commented on AI generated music barred from Bandcamp   old.reddit.com/r/BandCamp... · Posted by u/cdrnsf
subdavis · a month ago
Same for me! Switched to Bandcamp + Navidrome and have decided that one of my goals for the year is to find at least 2 albums per month I want to buy.

I will shamelessly promote the bandcampsync [1] CLI tool for automating downloads of your bandcamp library and bandcamp-sync-flask [2] wrapper that I built so I could invoke it from the web on my phone after I buy an album.

[1] https://github.com/meeb/bandcampsync

[2] https://github.com/subdavis/bandcamp-sync-flask

eloisius · a month ago
This is great. If you packaged it as a docker-compose YAML and maybe added a periodic task to poll automatically id drop it into Container Station in my NAS today.

u/eloisius

KarmaCake day5422January 6, 2008View Original