Readit News logoReadit News
yuvalr1 commented on Reasoning models reason well, until they don't   arxiv.org/abs/2510.22371... · Posted by u/optimalsolver
ftalbot · 2 months ago
Every token in a response has an element of randomness to it. This means they’re non-deterministic. Even if you set up something within their training data there is some chance that you could get a nonsense, opposite, and/or dangerous result. The chance of that may be low because of things being set up for it to review its result, but there is no way to make a non-deterministic answer fully bound to solving or reasoning anything assuredly, given enough iterations. It is designed to be imperfect.
yuvalr1 · 2 months ago
You are making a wrong leap from non-deterministic process to uncontrollable result. Most of the parallel algorithms are non-deterministic. There might be no guarantee about the order of calculation or even sometimes the final absolute result. However, even when producing different final results, the algorithm can still guarantee characteristics about the result.

The hard problem then is not to eliminate non-deterministic behavior, but find a way to control it so that it produces what you want.

yuvalr1 commented on Microsoft only lets you opt out of AI photo scanning 3x a year   hardware.slashdot.org/sto... · Posted by u/dmitrygr
jonas21 · 2 months ago
I don't really see the issue. If you don't want the face recognition feature, then you'll turn it off once, and that's that. Maybe if you're unsure, you might turn it off, and then back on, and then back off again. But what's the use case where you'd want to do this more than 3x per year?

Presumably, it's somewhat expensive to run face recognition on all of your photos. When you turn it off, they have to throw away the index (they'd better be doing this for privacy reasons), and then rebuild it from scratch when you turn the feature on again.

yuvalr1 · 2 months ago
Well, sometimes Microsoft decides to change your settings back. This has happened to me very frequently after installing Windows updates. I remember finding myself turning the same settings off time and again.
yuvalr1 commented on What Does Consulting Do?   nber.org/papers/w34072... · Posted by u/surprisetalk
yuvalr1 · 4 months ago
There are two kinds of consultants: those who write code and those who only give advise. It seems to me that those who only advise lost their market to LLMs pretty completely.
yuvalr1 commented on Iran asks its people to delete WhatsApp from their devices   apnews.com/article/iran-w... · Posted by u/rdrd
yuvalr1 · 6 months ago
I'm surprised reading that the Iranian's regime concerns are centered on WhatsApp sharing information with Israel. It is much more likely that WhatsApp have 0-day vulnerabilities used by the Mossad to gain the info than WhatsApp actively sharing it.
yuvalr1 · 6 months ago
> Iran banned WhatsApp and Google Play in 2022 during mass protests against the government

So more than fearing Israel, they actually fear the public that has an encrypted communication channel that can't be tapped by their police. Explains a lot.

yuvalr1 commented on Iran asks its people to delete WhatsApp from their devices   apnews.com/article/iran-w... · Posted by u/rdrd
yuvalr1 · 6 months ago
I'm surprised reading that the Iranian's regime concerns are centered on WhatsApp sharing information with Israel. It is much more likely that WhatsApp have 0-day vulnerabilities used by the Mossad to gain the info than WhatsApp actively sharing it.
yuvalr1 commented on Iran asks its people to delete WhatsApp from their devices   apnews.com/article/iran-w... · Posted by u/rdrd
cess11 · 6 months ago
If you want actual security, don't use a phone and run your own server, likely Matrix.
yuvalr1 · 6 months ago
Security strength is not a binary measure, there are many levels of security between "no encryption at all" to "run your own server".
yuvalr1 commented on Trapping misbehaving bots in an AI Labyrinth   blog.cloudflare.com/ai-la... · Posted by u/pabs3
momojo · 9 months ago
In this 'arms race', will this serve as an actual deterrent? Can anyone involved in scraping chime in?
yuvalr1 · 9 months ago
I am not involved in scraping, but to me this sounds like simply another tool in the arsenal. They say it's hard for the scraper to realize it has been caught this way because it's not being blocked. However, I don't see anything preventing scrapers from implementing heuristics to realize that.
yuvalr1 commented on Diagrams AI can, and cannot, generate   ilograph.com/blog/posts/d... · Posted by u/billyp-rva
diggan · 9 months ago
A mistake I see people repeating over and over, is never restarting their conversations with a edited initial message.

Instead of doing what the author is doing here, and sending messages back and forward, leading to a longer and longer conversation, where each messages leads to worse and worse quality replies, until the LLM seems like a dumb rock, rewrite your initial message with everything that went wrong/was misunderstood, and aim to have whatever you want solved in the first message, and you'll get a lot higher quality answers. If the LLM misunderstood, don't reply "No, what I mean was..." but instead rewrite the first message so it's clearer.

This is at least true for all ChatGPT, Claude and DeepSeek models, YMMV with other models.

yuvalr1 · 9 months ago
This means the leading UI for LLMs - the chat - is the wrong UI, at least for some of the tasks. We should instead have a single query text field, like in search engines, that you continue to edit and refine, just like in complex search queries.
yuvalr1 commented on Big LLMs weights are a piece of history   antirez.com/news/147... · Posted by u/freeatnet
badlibrarian · 9 months ago
No. And every video game every made is available for download as well. If you even have to download it: they pride in making many of them playable in browser with just a click.

Copyright issues aside (let's avoid that mess) I was referring to basic technical issues with the site. Design is atrocious, search doesn't work, you can click 50 captures of a site before you find one that actually loads, obvious data corruption, invented their own schema instead of using a standard one and don't enforce it, API is insane and usually broken, uploader doesn't work reliably, don't honor DMCA requests, ask for photo id and passports then leak them ...

It's the worst possible implementation of the best possible idea.

yuvalr1 · 9 months ago
And yet, it's the best we currently have. I donate to them. We can come with demands of how it should be managed, but it should not prevent us from helping them.
yuvalr1 commented on About Google Chrome's "This extension may soon no longer be supported" (2024)   github.com/gorhill/uBlock... · Posted by u/0x000042
lnl · 9 months ago
The parent comment is talking about distractions, not ads. YouTube has plenty of those, even embedded YouTube videos, unless you pause the video before it ends. uBlock Origin Lite cannot block elements except through packaged rulesets, and while there are some ad-blocker lists that are meant to block annoyances on pages in addition to ads, everybody has a different idea on what is an annoyance on a webpage.
yuvalr1 · 9 months ago
exactly. I was not referring to ads, but to annoying suggestions in embedded videos (also when the video is paused!), and even to the long and mostly useless suggestions list on the right of the screen. I want to use YouTube as a useful tool, not to waste my time in endless loops of "oh, that looks interesting!"

u/yuvalr1

KarmaCake day565January 12, 2014View Original