rajman187 (u/rajman187)

rajman187 commented on How I estimate work seangoedecke.com/how-i-es... · Posted by u/mattjhall

rajman187 · 2 months ago

i think it's worth revisiting this in a short while because, by and large, how the engineering craft has been for the last 40+ years is no longer the correct paradigm. it takes Claude Code a few moments to put together an entire proof of concept. engineers, especially experienced ones, will be less likely to produce (and hence be performance-calibrated on) code as output but rather orchestration and productionization of [a fleet of] agents. how do you guide an llm to produce exactly what is needed, based on your understanding of constraints, available libraries, various systems and APIs, etc. to accomplish some business or research goal?

in that sense, estimation should theoretically become a more reasonable endeavor. or maybe not, we just end up back where we are because the llm has produced unusable code or an impossible-to-find bug which delays shipment etc.

rajman187 commented on All-optical synthesis chip for large-scale intelligent semantic vision science.org/doi/10.1126/s... · Posted by u/QueensGambit

Nevermark · 2 months ago

I think we have barely scratched the surface of post-trained inference/generative model inference efficiency.

A uniquely efficient hardware stack, for either training or inference, would be a great moat in an industry that seems to offer few moats.

I keep waiting to here of more adoption of Cerebras Systems' wafer-scale chips. They may be held back by not offering the full hardware stack, i.e. their own data centers optimized around wafer-scale compute units. (They do partner with AWS, as a third party provider, in competition with AWS own silicon.)

rajman187 · 2 months ago

Re: cerebras, they filed a S1 [1] last year when attempting to go public. It showed something like a $60M+ loss for the first 6 months of 2024. The IPO didn’t happen because the CEO’s past included some financial missteps and the banks didn’t want to deal with this. At the time the majority of their revenue came from a single source in Abu Dhabi, as well. They did end up benefiting by the slew of open source model releases which enabled them to become inference providers via APIs rather than needing to provide the full stack for training.

[1] https://www.sec.gov/Archives/edgar/data/2021728/000162828024...

rajman187 commented on We cut our Mongo DB costs by 90% by moving to Hetzner prosopo.io/blog/we-cut-ou... · Posted by u/arbol

0xbadcafebee · 4 months ago

> Here's how we managed to cut our costs by 90%

You could cut your MongoDB costs by 100% by not using it ;)

> without sacrificing performance or reliability.

You're using a single server in a single datacenter. MongoDB Atlas is deployed to VMs on 2-3 AZs. You don't have close to the same reliability. (I'm also curious why their M40 instance costs $1000, when the Pricing Calculator (https://www.mongodb.com/pricing) says M40 is $760/month? Was it the extra storage?)

> We're building Prosopo to be resilient to outages, such as the recent massive AWS outage, so we use many different cloud providers

This means you're going to have multiple outages, AND incur more cross-internet costs. How does going to Hetzner make you more resilient to outages? You have one server in one datacenter. Intelligent, robust design at one provider (like AWS) is way more resilient, and intra-zone transfer is cheaper than going out to the cloud ($0.02/GB vs $0.08/GB). You do not have a centralized or single point of failure design with AWS. They're not dummies; plenty of their services are operated independently per region. But they do expect you to use their infrastructure intelligently to avoid creating a single point of failure. (For example, during the AWS outage, my company was in us-east-1, and we never had any issues, because we didn't depend on calling AWS APIs to continue operating. Things already running continue to run.)

I get it; these "we cut bare costs by moving away from the cloud" posts are catnip for HN. But they usually don't make sense. There's only a few circumstances where you really have to transfer out a lot of traffic, or need very large storage, where cloud pricing is just too much of a premium. The whole point of using the cloud is to use it as a competitive advantage. Giving yourself an extra role (sysadmin) in addition to your day job (developer, data scientist, etc) and more maintenance tasks (installing, upgrading, patching, troubleshooting, getting on-call, etc) with lower reliability and fewer services, isn't an advantage.

rajman187 · 4 months ago

> You could cut your MongoDB costs by 100% by not using it ;)

Came here to say exactly this

rajman187 commented on GPT-OSS 120B Runs at 3000 tokens/sec on Cerebras cerebras.ai/blog/openai-g... · Posted by u/samspenc

petesergeant · 4 months ago

Not doubting you but anything to back that up? Happy enough to burn VC money until someone shows up who can run it without losing money, either way.

rajman187 · 4 months ago

They’ve filed a S1 [1] last year when attempting to go public. It showed something like a $60M+ loss for the first 6 months of 2024. The IPO didn’t happen because the CEO’s past included some financial missteps and the banks didn’t want to deal with this. At the time the majority of their revenue came from a single source in Abu Dhabi, as well

[1] https://www.sec.gov/Archives/edgar/data/2021728/000162828024...

rajman187 commented on Amazon Rivian electric delivery vans arrive in Canada cleantechnica.com/2025/10... · Posted by u/TMWNN

perardi · 4 months ago

Ah, yes, the esteemed Tesla brand, so well-known for reliability, easy access to rapid maintenance, and the top-notch customer relations and support you need to support fleet vehicles…

…oh wait.

Teslas have terrible quality control, they’re impossible to get rapidly serviced, and they are run by a mercurial adversarial asshole. What kind of fleet manager worth their salt would _ever_ consider Tesla?

rajman187 · 4 months ago

If I could upvote this more than once I certainly would

rajman187 commented on Thoughts on Mechanical Keyboards and the ZSA Moonlander masteringemacs.org/articl... · Posted by u/TheFreim

rajman187 · 5 months ago

My main keyboard has been a 34-key split Ferris. I usually have either a trackpad between the halves if I’m using a Mac or an ergonomic Logitech if on my Linux desktop. Not having to move my hands at all while being able to reach any keys/characters I need has been a welcomed change, worth remapping my brain.

https://arjtala.github.io/2022/09/17/ferris-compact.html

rajman187 commented on DINOv3 github.com/facebookresear... · Posted by u/reqo

tough · 7 months ago

afaik now FAIR lives under the MSL umbrella https://x.com/AndrewCurran_/status/1940109829781270799

rajman187 · 7 months ago

Yeah the org structure is one thing, the missions are another. Yann adds some clarity here

https://www.linkedin.com/posts/yann-lecun_were-excited-to-ha...

rajman187 commented on DINOv3 github.com/facebookresear... · Posted by u/reqo

tough · 7 months ago

The new Facebook AI Czar wang hinted on previous interviews that Meta might change their stand on licensing/open source.

Seems like the tides are shifting at meta

rajman187 · 7 months ago

This has nothing to do with the newly appointed fellow nor Meta Superintelligence Labs, but rather work from FAIR that would have gone through a lengthy review process before seeing the light of day. Not fun to see the license change in any case

rajman187 commented on Understanding reinforcement learning for model training from scratch medium.com/data-science-c... · Posted by u/rajman187

rajman187 · 7 months ago

An intuitive treatment of RLHF, TRPO, PPO, GRPO, DPO and RLAIF