gamegoblin (u/gamegoblin)

gamegoblin commented on GPT‑5.3‑Codex‑Spark openai.com/index/introduc... · Posted by u/meetpateltech

nikkwong · 2 days ago

I have a hard time understanding how that would work — for me, I typically interface with coding agents through cursor. The flow is like this: ask it something -> it works for a min or two -> I have to verify and fix by asking it again; etc. until we're at a happy place with the code. How do you get it to stop from going down a bad path and never pulling itself out of it?

The important role for me, as a SWE, in the process, is verify that the code does what we actually want it to do. If you remove yourself from the process by letting it run on its own overnight, how does it know it's doing what you actually want it to do?

Or is it more like with your usecase—you can say "here's a failing test—do whatever you can to fix it and don't stop until you do". I could see that limited case working.

gamegoblin · 2 days ago

I use Codex CLI or Claude Code

I don't even necessarily ask it to fix the bug — just identify the bug

Like if I've made a change that is causing some unit test to fail, it can just run off and figure out where I made an off-by-one error or whatever in my change.

gamegoblin commented on GPT‑5.3‑Codex‑Spark openai.com/index/introduc... · Posted by u/meetpateltech

nikkwong · 2 days ago

> Our latest frontier models have shown particular strengths in their ability to do long-running tasks, working autonomously for hours, days or weeks without intervention.

I have yet to see this (produce anything actually useful).

gamegoblin · 2 days ago

I routinely leave codex running for a few hours overnight to debug stuff

If you have a deterministic unit test that can reproduce the bug through your app front door, but you have no idea how the bug is actually happening, having a coding agent just grind through the slog of sticking debug prints everywhere, testing hypotheses, etc — it's an ideal usecase

gamegoblin commented on Amazon cuts 16k jobs reuters.com/legal/litigat... · Posted by u/DGAP

int_19h · 16 days ago

"AI detectors" are notoriously unreliable.

Perhaps more importantly here, when it comes to writing, "AI slop" is basically management speak - it's all about waxing poetically about simple things in ways that make you sound complicated (and useful!). And this guy is a career manager. So I bet this is actually human slop, the kind from which ChatGPT et al learned to speak the way they do.

gamegoblin · 16 days ago

AI detectors in general are unreliable, but there are a few made by serious researchers that have only 1-in-10000 false positive rate, e.g. https://arxiv.org/pdf/2402.14873

Having worked in a bigcorp, I've read my fair share of management-speak, and none of it sounds quite as empty as the allegedly AI text.

The AI sounds like someone conjuring a parody emulation of management speak instead of actual management speak.

More broadly — and I feel this way about AI code at well as AI prose — I find that part of my brain is always trying to reverse engineer what kind of person wrote this, what was their mental state when writing it?

And when reading AI code or AI prose, this part of my brain short circuits a little. Because there is no cohesive human mind behind the text.

It's kind of like how you subconsciously learn to detect emotion in tiny facial movements, you also subconsciously learn to reverse engineer someone's mind state from their writing.

Reading AI writing feels like watching an alien in skinsuit try to emulate human face emotional cues — it's just not quite right in a hard-to-describe-but-easy-to-detect way.

gamegoblin commented on Amazon cuts 16k jobs reuters.com/legal/litigat... · Posted by u/DGAP

tclancy · 17 days ago

If this is AI slop as the knee jerk comments next to me suggest, it’s goin to be a hell of a surprise if he gets elected this year! https://www.nleeplumb.com/about

gamegoblin · 17 days ago

While reading the text, my mental AI alarm bells were going off, sent it all to pangram.com and it flags both the layoff post and his campaign website text as being 100% AI generated

gamegoblin commented on Open-source Zig book zigbook.net... · Posted by u/rudedogg

ants_everywhere · 3 months ago

Keep in mind that pangram flags many hand-written things as AI.

> I just ran excerpts from two unpublished science fiction / speculative fiction short stories through it. Both came back as ai with 99.9% confidence. Both stories were written in 2013.

> I've been doing some extensive testing in the last 24 hours and I can confidently say that I believe the 1 in 10,000 rate is bullshit. I've been an author for over a decade and have dozens of books at hand that I can throw at this from years prior to AI even existing in anywhere close to its current capacity. Most of the time, that content is detected as AI-created, even when it's not.

> Pangram is saying EVERYTHING I have hand written for school is AI. I've had to rewrite my paper four times already and it still says 99.9% AI even though I didn't even use AI for the research.

> I've written an overview of a project plan based on a brief and, after reading an article on AI detection, I thought it would be interesting to run it through AI detection sites to see where my writing winds up. All of them, with the exception of Pangram, flagged the writing as 100% written by a human. Pangram has "99% confidence" of it being written by AI.

I generally don't give startups my contact info, but if folks don't mind doing so, I recommend running pangram on some of their polished hand written stuff.

https://www.reddit.com/r/teachingresources/comments/1icnren/...

gamegoblin · 3 months ago

Weird to me that nobody ever posts the actual alleged false positive text in these criticisms

I've yet to see a single real Pangram false positive that was provably published when it says it was, yet plenty such comments claiming they exist

gamegoblin commented on Open-source Zig book zigbook.net... · Posted by u/rudedogg

geysersam · 3 months ago

Clearly your perception of what is AI generated is wrong. You can't tell something is AI generated only because it uses "not just X - Y" constructions. I mean, the reason AI text often uses it is because it's common in the training material. So of course you're going to see it everywhere.

gamegoblin · 3 months ago

I sent the text through an AI detector with 0.1% false positive rate and it was highly confident the Zig book introduction was fully AI-written

gamegoblin commented on Open-source Zig book zigbook.net... · Posted by u/rudedogg

fuzzy_biscuit · 3 months ago

Why does this feel like an ad? I've seen pangram mentioned a few times now, always with that tagline. It feels like a marketing department skulking around comments.

gamegoblin · 3 months ago

The other pangram mention elsewhere in this comment section is also me -- I'm totally unaffiliated with them, just a fan of their tool

I specify the accuracy and false positive rate because otherwise skeptics in comment sections might otherwise think it's one of the plethora of other AI detection tools that don't really work

gamegoblin commented on Open-source Zig book zigbook.net... · Posted by u/rudedogg

skor · 3 months ago

how do you know this? let us know please, thanks. edit, I see you used this to check: https://news.ycombinator.com/item?id=45948220

gamegoblin · 3 months ago

pangram.com, the most accurate and lowest false positive AI detector

https://www.pangram.com/blog/third-party-pangram-evals

gamegoblin commented on Open-source Zig book zigbook.net... · Posted by u/rudedogg

popcar2 · 3 months ago

The first page says none of the book was written by AI

gamegoblin · 3 months ago

Yes, it's a false claim

gamegoblin commented on Open-source Zig book zigbook.net... · Posted by u/rudedogg

gigatexal · 3 months ago

there's no way someone made this for free, where do I donate? im gonna get so much value from this this feels like stealing

gamegoblin · 3 months ago

It's AI-written FWIW

though maybe AI is getting to the point it can do stuff like this somewhat decently