sceptic123 (u/sceptic123)

sceptic123 commented on Some thoughts on LLMs and software development martinfowler.com/articles... · Posted by u/floverfelt

dragonwriter · 10 hours ago

> My former colleague Rebecca Parsons, has been saying for a long time that hallucinations aren’t a bug of LLMs, they are a feature. Indeed they are the feature. All an LLM does is produce hallucinations, it’s just that we find some of them useful.

This is an example of my least favorite style of feigned insight: redefining a term into meaninglessness just so you can say something that sounds different while not actually saying anything new.

Yes, if you redefine "hallucination" from "produce output containing detailed information despite that information not being grounded in external reality, in a manner distantly analogous to a human reporting sense data produced by a literal hallucination rather than the external inputs that are presumed normally to ground sense data" to "produce output", its true that all LLMs do is "hallucinate", and that "hallucinating" is not a undesirable behavior.

But you haven't said anything new about the thing that was called "hallucination" by everyone else, or about the thing--LLM output in general--that you have called "hallucination". Everyone already knew that producing output wasn't undesirable. You've just taken the label conventionally attached to a bad behavior, attached it to a broader category that includes all behavior, and used the power of equivocation to make something that sounds novel without saying anything new.

sceptic123 · 4 hours ago

I actually found that comment interesting. It's pointing towards something I've struggled with around LLMs. They are (currently) incapable of knowing if what they output is correct, so the idea that "it's all hallucinations" acknowledges that point and gives useful context for anyone using LLMs for software development.

sceptic123 commented on In a first, Google has released data on how much energy an AI prompt uses technologyreview.com/2025... · Posted by u/jeffbee

Capricorn2481 · 6 days ago

Well first of all, you're implying this measures our consumption at all. But it's left completely vague what this is a median of. They said "A point-in-time analysis quantified the energy consumed per median Gemini App text-generation prompt, considering data from May 2025".

Considering what data? All queries sent to Gemini? Real users? A select few? Test queries from Google?

Does it include AI summaries of google searches? Because if the data includes stuff as simple as "How tall is Lee Pace," that is obviously going to bring the median query down, even if the top distribution is using many times more energy.

But still, the median is not useful by itself. It tells us 50% of the queries measured were under 0.24Wh. It obviously obscures policy-relevant information to not include the mean, but it also obscures what I can do individually without more details on the data. Where am I on this median?

It makes the most sense to provide the entire distribution and examples of data points.

sceptic123 · 3 days ago

I'm not implying anything other than there's a tendency to lazy use of the word average

sceptic123 commented on In a first, Google has released data on how much energy an AI prompt uses technologyreview.com/2025... · Posted by u/jeffbee

oulipo2 · 8 days ago

I'm not sure why they would report on the median prompt, and not the average, which would give a better sense of (well average) consumption in this case

sceptic123 · 8 days ago

Median is _an_ average, are you wanting a mean consumption perhaps when you say average?

sceptic123 commented on 'Ad Blocking Is Not Piracy' Decision Overturned by Top German Court torrentfreak.com/ad-block... · Posted by u/gslin

sceptic123 · 10 days ago

Would an ad-blocker that put a black square over the screen where ads were placed, instead of rewriting the HTML, then bypass these laws?

sceptic123 commented on 95% of generative AI pilots at companies are failing – MIT report fortune.com/2025/08/18/mi... · Posted by u/amirkabbara

poszlem · 11 days ago

Not the OP, but I ask way more questions now than I used to. Before, I’d sometimes wonder about things, but not enough to actually go and research them. Now, it’s as simple as asking the AI, and more often than not, I get a satisfying answer.

sceptic123 · 11 days ago

Whether it's correct or not is another question

sceptic123 commented on Replacing tmux in my dev workflow bower.sh/you-might-not-ne... · Posted by u/elashri

coldpie · a month ago

Do you know if there is a way to quickly switch between only two individual windows in different applications? A very common paradigm for me is swapping between two windows, for example a terminal session for code editing and a browser window for reference. On Windows and most Linux WMs, this is just a quick alt-tab hit to toggle between the two most recently focused windows. As I know there is no way to do this on macOS without bringing _all_ the windows to the foreground, which is not what I want. This is my #1 complaint about macos, I'd be so happy if there is just some shortcut I'm missing to accomplish this.

sceptic123 · a month ago

I'm pretty sure that's part of what stage manager is for — you can drag windows in the same stage and they operate how you want — but there's too much manual setup required for me to realistically suggest it as an alternative.

There are a bunch of third party tools you can use though, [AltTab](1) is free and tries to replicate windows experience on Mac. [Raycast](2) has a Switch Windows command which also allows direct access to any window via the keyboard (bind to alt+tab if you like) amongst many other features.

[1] https://alt-tab-macos.netlify.app/ [2] https://www.raycast.com/

sceptic123 commented on Live coding interviews measure stress, not coding skills hadid.dev/posts/living-co... · Posted by u/mustaphah

apwheele · a month ago

I expect `python hello_world.py`, but if people are confused I just nudge them to what I expect. It is not meant to be a trick question.

If people do not have local setup, I just have them write out in text editor and walk through the steps. Maybe not 75% fail rate, but more like 50% of people fail this step in the tech round.

sceptic123 · a month ago

Isn't your experience highlighting what the article is suggesting though? That needing to do this during an interview is what causes these failures rather than an inability to actually perform the requested task.

It seems like the suggestion is to put them somewhere private to perform the task rather than asking them to do it in a public setting.

sceptic123 commented on Live coding interviews measure stress, not coding skills hadid.dev/posts/living-co... · Posted by u/mustaphah

paxys · a month ago

There are two ways to interview:

1. Make sure you pick every good candidate, but some bad candidates will slip through as well.

2. Make sure you reject every bad candidate, but some good candidates will fail as well.

Candidates want process #1, but companies have no reason to push for it. The cost of accidentally hiring a bad employee is simply too high, way more than rejecting a good employee. The current system in place prioritizes #2. Yes they are rejecting great candidates, and they are aware of it.

sceptic123 · a month ago

The article is suggesting that #2 will end up rejecting LOTS of good candidates (and potentially ALL female candidates)

sceptic123 commented on Replacing tmux in my dev workflow bower.sh/you-might-not-ne... · Posted by u/elashri

tezza · a month ago

MS Windows has excellent multi window management with Alt Tab Win Tab etc. Far superior to others.

I have all my terminals with distinct icons and background colours to tell them apart. The operating system (Windows) does the heavy lifting.

i tried Mac for about five years but missed MS Windows “every window can be alt tabbed to”. Mac has “every app can be command tabbed to and therein each app has its own subwindow management”

sceptic123 · a month ago

Whatever fits your mental model I suppose, but every window is accessible via keyboard shortcuts on the Mac too, it just needs a different approach.

sceptic123 commented on Replacing tmux in my dev workflow bower.sh/you-might-not-ne... · Posted by u/elashri

joleyj · a month ago

> “every app can be command tabbed to and therein each app has its own subwindow management”

This is so, so annoying. Your Mac app’s window is minimized? No alt-tab for you!

sceptic123 · a month ago

Just CMD+TAB to your required app, then hit ↓ arrow and you get access to all your windows. Minimised windows appear at the bottom of the screen.