Readit News logoReadit News
insin commented on From GPT-4 to GPT-5: Measuring progress through MedHELM [pdf]   fertrevino.com/docs/gpt5_... · Posted by u/fertrevino
vexna · 2 days ago
kool aid or not -- "reasoning" is already part of the LLM verbiage (e.g `reasoning` models having `reasoningBudget`). The meaning might not be 1:1 to human reasoning, but when the LLM shows its "reasoning" it does look _appear_ like a train of thought. If I had to give what it's doing a name (like I'm naming a function), I'd be hard pressed to not go with something like `reason`/`think`.
insin · 2 days ago

    prefillContext()

insin commented on Vibe Coding Is the Worst Idea of 2025 [video]   youtube.com/watch?v=1A6uP... · Posted by u/tomwphillips
bitpush · 4 days ago
Here's how Innovators Dilemma plays out.

Step 1: Some upstarts create a new way of doing something. It’s clunky and unrefined.

Step 2: "Experts" and senior folks in the field dismiss it as a "toy." It doesn't follow their established rules or best practices and seems amateurish. They wouldn't recommend it to anyone serious.

Step 3: The "toy" gets adopted by a small group of outsiders or newcomers who aren't burdened by the "right way" of doing things. They play with it, improve it, and find new applications for it.

Step 4: The "toy" becomes so effective and widespread that it becomes the new standard. The original experts are left looking out of touch, their deep knowledge now irrelevant to the new way of doing things.

We're at step 2, bordering on 3.

* Executives at Nokia and BlackBerry saw the first iPhone, with its lack of a physical keyboard, as an impractical toy for media consumption, not a serious work device.

* Professional photographers viewed the first low-resolution digital cameras as flimsy gadgets, only for them to completely decimate the film industry.

insin · 4 days ago
insin commented on "Remove mentions of XSLT from the html spec"   github.com/whatwg/html/pu... · Posted by u/troupo
BoiledCabbage · 5 days ago
So if in reading the two threads correctly essentially Google asked for feedback, essentially all the feedback said "no, please don't". And they said "thanks for the feedback, we're gonna do it any way!"?

The other suggestions ignored seemed to be "if this is about security, then fund the OSS, project. Or swap to a newer safer library, or pull it into the JS sandbox and ensure support is maintained." Which were all mostly ignored.

And "if this is about adoption then listen to the constant community request to update the the newer XSLT 3.0 which has been out for years and world have much higher adoption due to tons of QoL improvements including handling JSON."

And the argument presented, which i don't know (but seems reasonable to me), is that XSLT supports the open web. Google tried to kill it a decade ago, the community pushed back and stopped it. So Google's plan was to refuse to do anything to support it, ignore community requests for simple improvements, try to make it wither then use that as justification for killing it at a later point.

Forcing this through when almost all feedback is against it seems to support that to me. Especially with XSLT suddenly/recebtly gaining a lot of popularity and it seems like they are trying to kill it before they have an open competitor in the web.

https://github.com/whatwg/html/issues/11523

insin · 5 days ago
Google tells you what they're going to do to the web with a question mark on the end.
insin commented on Show HN: I built an app to block Shorts and Reels   scrollguard.app/... · Posted by u/adrianhacar
anoojb · 6 days ago
Is there an alternate web client for iOS that's only videos from channels you've subscribed to?
insin · 6 days ago
Control Panel for YouTube lets you hide Home, Shorts, Related videos, plus anything you don't want to see in Subscriptions (including videos you've already watched), plus you can disable autoplay of random videos, block ads and more, in YouTube itself.

https://soitis.dev/control-panel-for-youtube

insin commented on Web apps in a single, portable, self-updating, vanilla HTML file   hyperclay.com/... · Posted by u/pil0u
pjmlp · 6 days ago
Someone rediscoved Windows 98 HTA archives.

https://en.wikipedia.org/wiki/HTML_Application

insin · 6 days ago
The original Electron! Pity the versions of IE it was available for were still hell to debug in.
insin commented on When did AI take over Hacker News?   zachperk.com/blog/when-di... · Posted by u/zachperkel
jacquesm · 7 days ago
I'm more annoyed at the - clearly - AI based comments than the articles themselves. The articles are easy to ignore, the comments are a lot harder. In light of that I'd still love it if HN created an ignore feature, I think the community is large enough now that that makes complete sense. It would certainly improve my HN experience.
insin · 7 days ago
I added muting and annotating users to my Hacker News extension:

https://soitis.dev/comments-owl-for-hacker-news

insin commented on Node.js is able to execute TypeScript files without additional configuration   nodejs.org/en/blog/releas... · Posted by u/steren
ryuuseijin · 7 days ago
I'm using tsx for a project to achieve the same effect. As you said, it saves you from having to set up a build/transpilation step, which is very useful for development. Tsx has a --watch feature built in as well, which allows me to run a server from the typescript source files and automatically restart on changes. Maybe with nodemon and this new node improvement this can now done without tsx.

To check types at runtime (if that can even be done in a useful way?) it would have to be built into v8, and I suppose that would be a whole rewrite.

insin · 7 days ago
Node has had a built-in --watch flag for a while too:

https://nodejs.org/docs/latest/api/cli.html#--watch

insin commented on OpenAI Progress   progress.openai.com... · Posted by u/vinhnx
isaacremuant · 8 days ago
Except people use the same examples like blueberry and strawberry, which were used months ago, as if they're current.

These models can also call Counter from python's collections library or whatever other algorithm. Or are we claiming it should be a pure LLM as if that's what we use in the real world.

I don't get it, and I'm not one to hype up LLMs since they're absolutely faulty, but the fixation over this example screams of lack of use.

insin · 8 days ago
It's the most direct way to break the "magic computer" spell in users of all levels of understanding and ability. You stand it up next to the marketing deliberately laden with keywords related to human cognition, intended to induce the reader to anthropomorphise the product, and it immediately makes it look as silly as it truly is.

I work on the internal LLM chat app for a F100, so I see users who need that "oh!" moment daily. When this did the rounds again recently, I disabled our code execution tool which would normally work around it and the latest version of Claude, with "Thinking" toggled on, immediately got it wrong. It's perpetually current.

insin commented on Best Practices for Building Agentic AI Systems   userjot.com/blog/best-pra... · Posted by u/vinhnx
Ros23 · 8 days ago
"no-bullshit write up" about Agentic AI ... LOL
insin · 8 days ago
I've never seen so many different names at once for "LLM chat completion API call"

u/insin

KarmaCake day1490January 30, 2009
About
jonathan.buchanan@gmail.com / @jbscript.dev on the butterfly place / personal: https://jbscript.dev / biz: https://soitis.dev

You know, I'm something of a browser extension creator myself

Control Panel for Twitter improves Twitter somewhat: https://soitis.dev/control-panel-for-twitter

Control Panel for YouTube improves YouTube somewhat: https://soitis.dev/control-panel-for-youtube

Comments Owl for Hacker News makes it actually possible to follow chat threads here over time: https://soitis.dev/comments-owl-for-hacker-news

AI Overview Hider for Google: https://soitis.dev/ai-overview-hider-for-google

View Original