Readit News logoReadit News
MrCheeze commented on Busy beaver hunters reach numbers that overwhelm ordinary math   quantamagazine.org/busy-b... · Posted by u/defrost
wodenokoto · 4 days ago
Numberphile just did a video on subcubic graph numbers which grows much, much faster than Tree numbers.

Do we know if they grow faster than busy beavers?

https://youtu.be/4-eXjTH6Mq4

MrCheeze · 3 days ago
The busy beaver function is interesting precisely because you _can't_ come up with any computable function that grows faster.
MrCheeze commented on Claude says “You're absolutely right!” about everything   github.com/anthropics/cla... · Posted by u/pr337h4m
MrCheeze · 15 days ago
Claude almost universally reacts to everything with a positive exclamation as its first sentence, regardless of whether it's good or bad. If you don't believe me, just watch https://www.twitch.tv/claudeplayspokemon for about three minutes and you'll get the idea.

Alternatively, look at the system prompt, where Anthropic attempted to get it to stop doing this: > Claude never starts its response by saying a question or idea or observation was good, great, fascinating, profound, excellent, or any other positive adjective. It skips the flattery and responds directly. https://docs.anthropic.com/en/release-notes/system-prompts#a...

This problem seems highly specific to Claude. It's not exactly sycophancy so much as it is a strong bias towards this exact type of reaction to everything.

MrCheeze commented on Evolving OpenAI's Structure   openai.com/index/evolving... · Posted by u/rohitpaulk
blagie · 4 months ago
No, this only happens if:

1) You're successful.

2) You mess up checks-and-balances at the beginning.

OpenAI did both.

Personally, I think at some point, the AGs ought to take over and push it back into a non-profit format. OAI undermines the concept of a non-profit.

MrCheeze · 4 months ago
With 2, the real problem is that approximately 0% of the OpenAI employees actually believed in the mission. Pretty much every single one of them signed the letter to the board demanding that if the company's existence ever comes into conflict with humanity's survival, the company's existence comes first.
MrCheeze commented on Speedrunners are vulnerability researchers, they just don't know it yet   zetier.com/speedrunners-a... · Posted by u/chc4
tonetegeatinst · 6 months ago
I remember watching a video about this a while ago....it was a fresh perspective into a side of security research I didn't consider.
MrCheeze · 6 months ago
Are you thinking of Bismuth's "Speedrunning as a gateway to scientific endeavours", perhaps?

https://www.youtube.com/watch?v=w8_1lQ2KH50

MrCheeze commented on Speedrunners are vulnerability researchers, they just don't know it yet   zetier.com/speedrunners-a... · Posted by u/chc4
Boldened15 · 6 months ago
Since speedrunners who find glitches are obviously very technical, do they usually already have some sort of day job in tech? I imagine it might be easier and just as lucrative to work on some CRUD app 9-5 and devote the rest of their time to research/streaming, and may be preferable to overloading their brain with even more of the same kind of research.
MrCheeze · 6 months ago
As an n=1 data point, that was my exact situation for a while. Also a lot of the people who put out high effort stuff are college students, which works for the same reason.

More interestingly and more surprisingly, some of the people who work on exploiting games _don't_ do any sort of tech work and have no background in compsci - they're purely self educated just for the sole purpose of breaking the one game they're interested in. This was the case for some of the biggest contributors to ACE in Zelda Ocarina of Time.

MrCheeze commented on Speedrunners are vulnerability researchers, they just don't know it yet   zetier.com/speedrunners-a... · Posted by u/chc4
MrCheeze · 6 months ago
I've wondered myself why there's so little overlap between these two closely related interests of mine. Some of it seems to be the "But I don't want to cure cancer. I want to turn people into dinosaurs." effect, where some of the people working on exploiting games ONLY care about what can be done in their one game of interest - it doesn't always generalize to interest in using the same techniques against everything else.

Of course there's also the fact that exploiting 20-30 year old games is just vastly easier than modern software, due to the total lack of mitigations in them. And that's on top of the fact that with popular games, you're building on decades of reverse engineering work rather than (potentially) starting from scratch. And the arguably superior toolset (savestates etc).

But I think a very big factor is the one this blogpost is trying to address - most people just don't know anything at all about the vuln research industry, which is not exactly searching for attention in the ways that speedruns broadcast to hundreds of thousands of viewers for charity are.

MrCheeze commented on DeepSeek-R1   github.com/deepseek-ai/De... · Posted by u/meetpateltech
byteknight · 7 months ago
Disclaimer: I am very well aware this is not a valid test or indicative or anything else. I just thought it was hilarious.

When I asked the normal "How many 'r' in strawberry" question, it gets the right answer and argues with itself until it convinces itself that its (2). It counts properly, and then says to it self continuously, that can't be right.

https://gist.github.com/IAmStoxe/1a1e010649d514a45bb86284b98...

MrCheeze · 7 months ago
How long until we get to the point where models know that LLMs get this wrong, and that it is an LLM, and therefore answers wrong on purpose? Has this already happened?

(I doubt it has, but there ARE already cases where models know they are LLMs, and therefore make the plausible but wrong assumption that they are ChatGPT.)

MrCheeze commented on A liar who always lies says "All my hats are green."   theguardian.com/science/2... · Posted by u/ColinWright
chiakic · 9 months ago
The phrase “All my hats are green” implicitly carries the premise “I have hats” in conversational language. Thus, if this statement were to be expressed as a logical proposition, it should be: “I have hats, and all my hats are green.”

This means there are two potential falsehoods in the statement: 1. “I have hats.” 2. “All my hats are green.”

Therefore, what we can deduce is: • He might not have any hats. • If he does have hats, at least one of them is not green.

However, considering the options provided by the author, it is clear they did not take into account the implicitly stated proposition (1) in conversational language. Instead, the author assumes that if he has no hats, then “All my hats are green” is true.

This interpretation, however, is conversationally unreasonable; otherwise, one could claim something equally absurd, such as “All my houses are worth over 100 million dollars” but actually has no house.

MrCheeze · 9 months ago
In conversational language, "All my hats..." implies that the speaker has at least two hats, which theoretically means that the sentence could be a lie from them having exactly one hat (even a green one). However, in practice, I don't think anyone would actually call that a lie. I think we treat the "I have hats" part not as part of the sentence itself, but more as an underlying premise.

You could have a similar situation in pure logic or math - if I were to say "the largest prime number is odd", is that false? Or something else entirely? (This is what Hofstadter calls mu, from a related concept in Zen.)

MrCheeze commented on With fifth busy beaver, researchers approach computation's limits   quantamagazine.org/amateu... · Posted by u/LegionMammal978
DowsingSpoon · a year ago
Maybe I’m just an unsophisticated code monkey, but I read a little about Busy Beaver from time to time and I just don’t get it. Why is this an interesting problem? What do we hope to learn from it?

u/MrCheeze

KarmaCake day89July 15, 2023
About
I did this stuff: https://mrcheeze.github.io/
View Original