Readit News logoReadit News
sc077y commented on OpenAI are quietly adopting skills, now available in ChatGPT and Codex CLI   simonwillison.net/2025/De... · Posted by u/simonw
petetnt · 4 days ago
It’s impressive how every iteration tries to get further from pretending actual AGI would be anywhere close when we are basically writing library functions with the worst DSL known to man, markdown-with-english.
sc077y · 4 days ago
Who knew that English would be the most popular programming language of 2025?
sc077y commented on 73% of AI startups are just prompt engineering   pub.towardsai.net/i-rever... · Posted by u/kllrnohj
sc077y · 23 days ago
73% of statistics are wrong
sc077y commented on IQ tests results for AI   trackingai.org/home... · Posted by u/stared
gpt5 · 4 months ago
The way human IQ testing developed is that researchers noticed people who excel in one cognitive task tend to do well in others - the “positive manifold.”

They then hypothesized a general factor, “g,” to explain this pattern. Early tests (e.g., Binet–Simon; later Stanford–Binet and Wechsler) sampled a wide range of tasks, and researchers used correlations and factor analysis to extract the common component, then norm it around 100 with a SD of 15 and call it IQ.

IQ tend to meaningfully predicts performance across some domains especially education and work, and shows high test–retest stability from late adolescence through adulthood. It is also tend to be consistent between high quality tests, despite a wide variety of testing methods.

It looks like this site just uses human rated public IQ tests. But it would have been more interesting if an IQ test was developed specifically for AI. I.e. a test that would aim to Factor out the strength of a model general cognitive ability across a wide variety of tasks. It is probably doable by doing principal component analysis on a large set of benchmarks available today.

sc077y · 4 months ago
I believe the ARC-AGI benchmark fits that description, it's sort of an IQ test for LLMs, though I would caution against using the word "Intelligence" for LLMs.
sc077y commented on Steve Wozniak: Life to me was never about accomplishment, but about happiness   yro.slashdot.org/comments... · Posted by u/MilnerRoute
sc077y · 4 months ago
Woz is just a nerd, simple as that. And he stayed true to himself and that ethos his whole life.
sc077y commented on 6 weeks of Claude Code   blog.puzzmo.com/posts/202... · Posted by u/mike1o1
sc077y · 4 months ago
Every time you use these tools irresponsibly, for instance for what I like to call headless programming (vibe coding), understand that you are incurring tech debt. Not just in terms of your project but personal debt regarding what you SHOULD have learned in order to implement the solution.

It’s like using ChatGPT in high school: it can be a phenomenal tutor, or it can do everything for you and leave you worse off.

The general lesson from this is that Results ARE NOT everything.

sc077y commented on 6 weeks of Claude Code   blog.puzzmo.com/posts/202... · Posted by u/mike1o1
_l7dh · 4 months ago
Feels like the most valuable skill to have as a programmer in times of Claude Code is that of carefully reading spec documentation and having an acute sense of critical thinking when reviewing code.
sc077y · 4 months ago
Critical Skills is spotting the potential bugs before they happen but in order to do that you need to have an extremely acute understanding or a have a lot of experience in the stack, libs and programming language of choice. Something that, ironically, you will not get by "vibe coding".
sc077y commented on Uv: Running a script with dependencies   docs.astral.sh/uv/guides/... · Posted by u/Bluestein
Hackbraten · 5 months ago
Unless it can't because you happen to have mounted your user cache directory from a different volume in an attempt to debloat your hourly backups.
sc077y · 5 months ago
In that case, you use copy OR what you can can also do, if you really care about disk usage, is use symbolic links between the drives. have a .venv sym link on drive A (raid 1) point to the uv_cache_dir's venv on drive B (raid 0). I have not tested though what happens when you unmount and sync.
sc077y commented on Claude Code Router   github.com/musistudio/cla... · Posted by u/y1n0
sc077y · 5 months ago
I tried installing and setting up the project today, it was miserable. I finally got it to work only to find out that the mistral models' tool calling does not work at all for claude code. Also, there is no mention anywhere of what models actually support anthropic level tool calling. If anyone knows if there are some open weight models (deepseek or others) I can host on my infra to get this to work out of the box that would be amazing.
sc077y commented on Steve Jobs' cabinet   perfectdays23.substack.co... · Posted by u/padraigf
sc077y · 5 months ago
I recently saw an interview without someone on the mac team and what's interesting is that the original Mac team had a lot of friction because of this philosophy. Jobs constantly asked unreasonable design constraints of his engineering team, the team prepared two laptops one with the "esthetic" laptop and one with the pragmatic design. One vastly outperformed the other and Jobs conceded, this was a constant pattern for the engineering teams at Apple. What you see before you is not the esthetic version but the pragmatic version. Sometimes Jobs was right but most of the time he was delusional.
sc077y commented on Uv: Running a script with dependencies   docs.astral.sh/uv/guides/... · Posted by u/Bluestein
slightwinder · 5 months ago
> Save that as script.py and you can use "uv run script.py" to run it with the specified dependencies,

Be aware that uv will create a full copy of that environment for each script by default. Depending on your number of scripts, this could become wasteful really fast. There is a flag "--link-mode symlink" which will link the dependencies from the cache. I'm not sure why this isn't the default, or which disadvantages this has, but so far it's working fine for me, and saved me several gigabytes of storage.

sc077y · 5 months ago
By default it will create hard links for python packages, so it won't consume any more memory (besides the small overhead of hard links).

u/sc077y

KarmaCake day89January 2, 2023
About
Software Engineer, currently working in genAI.

I'm just a student of the game.

View Original