bugglebeetle (u/bugglebeetle)

bugglebeetle commented on The inefficiency of RL, and implications for RLVR progress dwarkesh.com/p/bits-per-s... · Posted by u/cubefox

vessenes · 20 days ago

So grumpy! Please pick up the torch and educate the world better; it can only help.

bugglebeetle · 20 days ago

I teach and mentor lots of folks in my world. What I don’t do is feign expertise to rub shoulders with the people doing the actual work so I can soak money from rubes with ad rolls.

bugglebeetle commented on The inefficiency of RL, and implications for RLVR progress dwarkesh.com/p/bits-per-s... · Posted by u/cubefox

refulgentis · 20 days ago

Dwarkesh's blogging confuses me, because I am not sure if the message is free-associating, or, relaying information gathered.

ex. how this reads if it is free-associating: "shower thought: RL on LLMs is kinda just 'did it work or not?' and the answer is just 'yes or no', yes or no is a boolean, a boolean is 1 bit, then bring in information theory interpretation of that, therefore RL doesn't give nearly as much info as, like, a bunch of words in pretraining"

or

ex. how this reads if it is relaying information gathered: "A common problem across people at companies who speak honestly with me about the engineering side off the air is figuring out how to get more out of RL. The biggest wall currently is the cross product of RL training being slowww and lack of GPUs. More than one of them has shared with me that if you can crack the part where the model gets very little info out of one run, then the GPU problem goes away. You can't GPU your way out of how little info they get"

I am continuing to assume it is much more A than B, given your thorough sounding explanation and my prior that he's not shooting the shit about specific technical problems off-air with multiple grunts.

bugglebeetle · 20 days ago

Dwarkesh has a CS degree, but zero academic training or real world experience in deep learning, so all of his blogging is just secondhand bullshitting to further siphon off a veneer of expertise from his podcast guests.

bugglebeetle commented on Jakarta is now the biggest city in the world axios.com/2025/11/24/jaka... · Posted by u/skx001

bugglebeetle · 25 days ago

[flagged]

bugglebeetle commented on Jakarta is now the biggest city in the world axios.com/2025/11/24/jaka... · Posted by u/skx001

throwaway2037 · 25 days ago

In Japan/ese, the pitch/stress thing is overrated, and so are regional language differences. When natives point it out to me, it strikes me a little more than cultural gatekeeping. Linguistic context matters much more. How often are you listening to your own native language and you are confused by two words that sounds similar (like 'hashi' in Japanese for bridge/chopsticks)? Almost never. Advice: Ignore it when natives that criticise your pronunciation. Ask them how is their German or Thai is... and they will freeze with shame.

Where I come from, to criticise a non-native speakers accent or small grammatical errors (that do not impact the meaning) is a not-so-subtle form of discrimination. As a result, I never do it. (To criticise myself, it tooks many, many years to see this about my home culture and stop doing it myself.) Still, many people ask me: "Hey, can you correct my <language X> when I speak it?" "Sure!" (but I never do.)

bugglebeetle · 25 days ago

Japanese actually has a much smaller set of phonemes (~half as many as English), resulting in extensive homophones. When combined with its greater tendency toward ambiguity, correct use of pitch can actually have a larger impact on intelligibility, as compared to many other languages.

bugglebeetle commented on Ilya Sutskever: We're moving from the age of scaling to the age of research dwarkesh.com/p/ilya-sutsk... · Posted by u/piotrgrabowski

pxc · 25 days ago

I don't remember all of the details so I can't remember if that came up in the episode I listened to. But I did listen to an episode where he talked to a (Chinese) guest about China. I discussed it with a Chinese friend at the time, and we both thought the guest was very interesting and well-informed, but the interviewer's questions were sometimes fantastical in a paranoid way, naively ideological, and often even a bit stupid.

It being the first (and so far only) interview of his I'd seen, between that and the AI boosterism, I was left thinking he was just some overblown hack. Is this a blind spot for him so that he's sometimes worth listening to on other topics? Or is he in fact an overblown hack?

bugglebeetle · 25 days ago

No, he’s an overblown hack who is pandering to the elements of his audience that would share those views about Nazism and China. Should many someday see through the veil of his bullshit or simply grow tired of his pablum, he can then pivot to being a far right influencer and continue raking in the dough, having previously demonstrated the proper bona fides.

bugglebeetle commented on Ilya Sutskever: We're moving from the age of scaling to the age of research dwarkesh.com/p/ilya-sutsk... · Posted by u/piotrgrabowski

Havoc · 25 days ago

There’s no way that wasn’t specifically prompted.

bugglebeetle · 25 days ago

To be fair, it could’ve been post-trained into the model as well…

bugglebeetle commented on Ilya Sutskever: We're moving from the age of scaling to the age of research dwarkesh.com/p/ilya-sutsk... · Posted by u/piotrgrabowski

just-the-wrk · 25 days ago

I think its important to include that Lex is laundromat for whatever the guest is trying to sell. Dwarkesh does an impressive amount of background and speaks with experts about their expertise.

bugglebeetle · 25 days ago

His recent conversation with Sutton suggests otherwise. Friedman is a vapid charlatan par excellence. Dwarkesh suffers from a different problem, where, by rubbing shoulders with experts, he has come to the mistaken belief that he possesses expertise, absent the humility and actual work that would entail.

bugglebeetle commented on Building more with GPT-5.1-Codex-Max openai.com/index/gpt-5-1-... · Posted by u/hansonw

stavros · a month ago

No it won't, it'll spend ten minutes and come back with "OK I've implemented a solution". I really wish it had a plan mode.

bugglebeetle · a month ago

Mileage may vary, but I do the above all day long without issue.