Readit News logoReadit News
CaveTech commented on Why can't transformers learn multiplication?   arxiv.org/abs/2510.00184... · Posted by u/PaulHoule
hodgehog11 · 2 months ago
> It clearly has very finite bounds by the fact we haven’t achieved agi by running an llm in a loop..

That's one hell of a criterion. Test-time inference undergoes a similar scaling law to pretraining, and has resulted in dramatically improved performance on many complex tasks. Law of diminishing returns kicks in of course, but this doesn't mean it's ineffective.

> akin to taking a few more stabs at RNG

Assuming I understand you correctly, I disagree. Scaling laws cannot appear with glassy optimisation procedures (essentially iid trials until you succeed, the mental model you seem to be implying here). They only appear if the underlying optimisation is globally connected and roughly convex. It's no different than gradient descent in this regard.

CaveTech · 2 months ago
I never made a claim that it's ineffective, just that it's of limited effectiveness. The diminishing returns kick in quickly, and it's not applicable in more domains than it is applicable.
CaveTech commented on Why can't transformers learn multiplication?   arxiv.org/abs/2510.00184... · Posted by u/PaulHoule
hodgehog11 · 2 months ago
I'm a bit confused by this; are you referring to vanishing/exploding gradients during training or iteration at inference? If the former, this is only true if you take too many steps. If the latter, we already know this works and scales well.
CaveTech · 2 months ago
The latter, and I would disagree that “this works and scales well” in the general sense. It clearly has very finite bounds by the fact we haven’t achieved agi by running an llm in a loop..

The approach of “try a few more things before stopping” is a great strategy akin to taking a few more stabs at RNG. It’s not the same as saying keep trying until you get there - you won’t.

CaveTech commented on Why can't transformers learn multiplication?   arxiv.org/abs/2510.00184... · Posted by u/PaulHoule
ddingus · 2 months ago
A while back I saw a post where people ran a model over and over to accomplish a code base port from one language to another.

In their prompt, they told it to leave itself a note and to accomplish something each time.

Then they put the model in a loop and it worked. In one instance, a model removed itself from the loop by editing a file or some other basic means.

To me, iterative tasks like like multiply and long divide, look an awful lot like the code port experiment.

Putting models into loops so they get more than one bite at the task seems to be a logical progression to improve capability.

CaveTech · 2 months ago
The amount of paths in the wrong direction are infinitely more than then number in the right direction. You'll quickly realize this doesn't actually scale.
CaveTech commented on Show HN: Autism Simulator   autism-simulator.vercel.a... · Posted by u/joshcsimmons
devmor · 3 months ago
Skipping breakfast absolutely causes a noticeable drop in energy for most people. Especially if you work an intellectually demanding job. That blood glucose is important.

Also of note given the OP’s subject, several studies show that people with significantly impactful Autism spectrum disorders have higher brain glucose consumption than the baseline.

CaveTech · 3 months ago
[citation needed]
CaveTech commented on Windsurf employee #2: I was given a payout of only 1% what my shares where worth   twitter.com/premqnair/sta... · Posted by u/rfurmani
jongjong · 5 months ago
I suppose the industry is not hot right now. EdTech was never really very hot. It was 'luke warm' at best, a decade ago. They own a lot of software, also, they publish their own math textbook (both digital and print). They have licenses with thousands of schools across multiple countries. I don't recall they have any debt.

I feel like they could easily bump up profits by $2 million just by letting go of people... But they could probably double the license cost per student. Although schools don't have much money, they are kind of slow and bureaucratic; set in their ways. It's a small cost for them anyway, once a system is part of the curriculum, they'll probably pay extra to avoid reorganizing the lessons.

CaveTech · 5 months ago
As you describe this is largely a cash flow business and the bulk of the value should be extracted via dividends to the benefit of major shareholders.

A tech enabled business needs gross margins north of 70% to be attractive from a leverage standpoint, unless revenue is scaling very rapidly. Without these there’s no attractive exit opportunities.

CaveTech commented on Windsurf employee #2: I was given a payout of only 1% what my shares where worth   twitter.com/premqnair/sta... · Posted by u/rfurmani
jongjong · 5 months ago
I worked for an ed-tech startup as employee number 4, joined when it was obscure; not even in the Alexa top 4 million rankings and almost no revenue. The founder was really good though and gave everyone shares instead of options. I got a bit under 0.2% equity in the company. The company grew (slowly and steadily) to $6.5 million USD revenue with about 10% net profit margins but its last valuation (over 10 years later) was like $8 million USD. They charge like $15 USD PER YEAR PER student for their product so very cheap; I feel like they could easily increase the prices given how widely used they are in my country (over 30% of students in my country use the app).

I had the option to sell some equity recently but it would have only been like $16K USD so I held... I had about $9K taken out of my salary to pay for those so it doesn't make sense to sell given the massive growth the app and not that much dilution... The financial gain barely covers the inflation.

It feels like both revenue and profits have been kept artificially low. $6.5 million per year revenue, still growing steadily, with a loyal customer base with 10% profit seems really good... A valuation of $8 million seems ridiculously low... Not even 2x revenue, for a tech platform with good lock-in factor (they sell a lot of licenses to schools)!

It's kind of amazing how bad a deal it is to work for someone else as an employee. Even if the founder is good and generous in many ways and the business side (which you have little control over as a developer) happens to work out pretty well, they can still pull all sorts of levers to make the deal bad. With this one, I'm going to wait it out 20 years if I must. A lot of the game is just timing, you gotta wait it out, sell at the top... Some people see a peak opportunity to cash-in multiple times in their lives, some people never see it! In my case, I haven't seen the top yet.

I never had any opportunity to make serious money ever. Never had an opportunity to pull the trigger and make even $100K. The best I ever got was in crypto, my crypto was worth $100K but I was earning like 100% annual yield and required a 1-month unlock period. So I made more than that by holding it for 3 years anyway...

I think my career story so far is quite interesting. Probably more interesting than 99% of the classic SV startup stories (at least what they say publicly). I've done some things nobody else has done. Made money in truly adverse environments where a lot of people hated my guts. I've seen people behave in strange ways. At times, I felt like I was almost breaking through the membrane of 'the matrix'; almost transcending my social class. But all I got for it was 3 years of passive income. I never had the opportunity to cash out big.

It's tough out there, so tough, it often feels fake/artificial. Often, it feels like you have to be 'chosen' and that's all that matters. Your work doesn't matter, how talented you are doesn't matter, how lucky you get doesn't matter (besides the luck of 'being chosen').

At the end of the day, money is like a river and people upstream from you get to decide whether or not the river will flow in your direction. When you understand that new money is created constantly and, just like the river, the water cycles between the mountain and the sea, you start to understand the value of positioning and 'being selected'. The people upstream will keep telling you that they don't control the flow of money; that the river flows naturally through the lowest valleys... It's your job to put yourself in that low valley... But really, they've built massive dams up there directing the water almost arbitrarily. You may be at the lowest valley but they're redirecting the water elsewhere artificially because it suits them better. Reality is that they can easily alter the path of the river anywhere they want and it has little to do with 'building something people want'. It's about building something the people upstream want... And sometimes they just want to help their existing friends; unfortunate for you if you are not their friend.

It's a catch-22; you need rich friends to get money but you need money to get rich friends. But I suspect it's way easier for a poor person to get rich by befriending a rich person than it is for a poor person to get rich without rich friends. The second approach feels like you're piercing through 'the matrix' because of all the weird almost conspiratorial resistance you might get (tech feels like one big club).

Sometimes you might accumulate some dirt on some rich people and that gives you some leverage over them but it's the kind of leverage where you have to keep coming back to them to get crumbs. I feel like you can never break through that way due to regulatory capture. You can only do limited damage to them and it's always costly to you. They still have the balance of power.

CaveTech · 5 months ago
Sorry to break it to you but 10% profit on 6.5M rev is very low and will absolutely not fetch a high multiple, especially considering this is a mature 10 year old business. This is not a high growth business and you may have grown overly rose colored glasses by thinking it could be priced as one.
CaveTech commented on Travelers to the U.S. must pay a new $250 'visa integrity fee' – what to know   cnbc.com/2025/07/18/visa-... · Posted by u/koolba
andy_ppp · 5 months ago
I was prepared to be scandalised but it seems fine…
CaveTech · 5 months ago
There are border towns where some workers do daily crossings. Without refund infrastructure in place it looks like this would add $250/day fee for all of these individuals
CaveTech commented on Why email startups fail   forwardemail.net/en/blog/... · Posted by u/skeptrune
nottorp · 6 months ago
It really depends.

If all you do is run games and discord on your home PC memory consumption won't matter.

If you have multiple uses or work from home ... Discord expanding to 4 G to display the meme channel with all those cat photos will be annoying to say the least.

Case in point, I stopped running Discord on my laptop. Still run it on a desktop to keep in touch with some people, but it's not my default goto for any communication.

Also, just because most users don't know better, it doesn't mean that Electron apps aren't basically disrespecting the user's resources and passing needless costs to them. Especially if you have hundreds of million users the extra cost they pay dwarfs whatever you the app developer would have paid for a working native application.

CaveTech · 6 months ago
The whole thesis OP is making is that this isn’t really true, evidenced by real world behaviours. Electron apps have some of the highest market share, even the worlds most popular ide is an electron app.

The amount of people who won’t adopt based on pricipal is exceedingly small.

CaveTech commented on Why email startups fail   forwardemail.net/en/blog/... · Posted by u/skeptrune
ajjenkins · 6 months ago
I’m surprised Hey isn’t mentioned. That’s the only example I know of someone recently trying to reinvent email. Maybe it wasn’t included because it’s part of Basecamp and not its own company. But I think it’s important to discuss if your argument is that “no one has successfully reinvented email”.
CaveTech · 6 months ago
It is mentioned, there's an entire section named "The HEY Experiment".
CaveTech commented on Why email startups fail   forwardemail.net/en/blog/... · Posted by u/skeptrune
egglemonsoup · 6 months ago
I am a real Discord customer who is actively looking for an alternative due to how terrible the performance is on my M1 MacBook and on my gaming PC. I'm just one person—I'm not claiming to represent the 'average' customer. But I am part of the average.
CaveTech · 6 months ago
The other part of the average is the 200m+ monthly active users who can't seem to find the uninstall button.

u/CaveTech

KarmaCake day1655July 27, 2011View Original