Readit News logoReadit News
makestuff commented on Cerebras launches Qwen3-235B, achieving 1.5k tokens per second   cerebras.ai/press-release... · Posted by u/mihau
aurareturn · a month ago
If this is the full fp16 quant, you'd need 2TB of memory to use with the full 131k context.

With 44GB of SRAM per Cerebras chip, you'd need 45 chips chained together. $3m per chip. $135m total to run this.

For comparison, you can buy a DGX B200 with 8x B200 Blackwell chips and 1.4TB of memory for around $500k. Two systems would give you 2.8TB memory which is enough for this. So $1m vs $135m to run this model.

It's not very scalable unless you have some ultra high value task that need super fast inference speed. Maybe hedge funds or some sort of financial markets?

PS. The reason why I think we're only in the beginning of the AI boom is because I can't imagine what we can build if we can run models as good as Claude Opus 4 (or even better) at 1500 tokens/s for a very cheap price and tens of millions of context tokens. We're still a few generations of hardware away I'm guessing.

makestuff · a month ago
I agree there will be some breakthrough (maybe by Nvidia or maybe someone else) that allows these models to run insanely cheap and even locally on a laptop. I could see a hardware company coming out with some sort of specialized card that is just for consumer grade inference for common queries. That way the cloud can be used for sever side inference and training.
makestuff commented on The vibe coder's career path is doomed   blog.florianherrengt.com/... · Posted by u/florianherrengt
phkahler · a month ago
>> There was never career path for a vibe coder. Vibe coder won’t be a thing in a year. There’ll be a new, meaningless term though.

Last year it was "prompt engineer", or was that 2 years ago already. Things move fast on the frontier...

makestuff · a month ago
I completely forgot about prompt engineers. I remember now startups offering insane salaries to them a few years ago. IMO the next iteration will be planning engineers since the new thing seems to be the plan/execution split of these vibe coding tools.
makestuff commented on NYC's office-to-residential conversions could create 17,000 new homes   6sqft.com/nycs-first-wave... · Posted by u/geox
vondur · a month ago
It seems like it'd be pretty expensive to convert office buildings to apartments. I'd imagine there are a ton of building regulations that office buildings don't have to adhere to compared to an apartment.
makestuff · a month ago
I have toured a few converted buildings in NYC. You end up with weird apartment layouts for reasons you mentioned. For example, a lot of the layouts were really long and narrow to satisfy the window requirement since office buildings are generally just giant square floors where you can have desks without direct window access. I think the other issues are running plumbing and electrical to each unit where an office building just needs a central room with plumbing for a common bathroom area.

Overall the economics seem to work in NYC since rents are so expensive, but I would imagine converting an office tower in a MCOL or LCOL city would be harder to make profitable.

makestuff commented on Grok 4 Launch [video]   twitter.com/xai/status/19... · Posted by u/meetpateltech
andreygrehov · a month ago
I just tried Grok 4 and it's insanely good. I was able to generate 1,000 lines of Java CDK code responsible for setting up an EC2 instance with certain pre-installed software. Grok produced all the code in one iteration. 1,000 lines of code, including VPC, Security Groups, etc. Zero syntax errors! Most importantly, it generated userData (#!/bin/bash commands) with accurate `wget` pointing to valid URLs of the latest software artifacts on GitHub. Insane!
makestuff · a month ago
Out of curiosity, why do you use Java instead of typescript for CDK? Just to keep everything in one language?
makestuff commented on Meta invests $14.3B in Scale AI to kick-start superintelligence lab   nytimes.com/2025/06/12/te... · Posted by u/RyanShook
makestuff · 2 months ago
Will Employees be able to gain any liquidity from their options, or is this technically another round with no tender offer?
makestuff commented on I'm Peter Roberts, immigration attorney, who does work for YC and startups. AMA    · Posted by u/proberts
makestuff · 3 months ago
As someone who is a US Citizen, what is something about the immigration process that I probably do not know about, but causes a lot of issues/could be improved.
makestuff commented on Rippling sues Deel over spying   twitter.com/parkerconrad/... · Posted by u/amacneil
swyx · 5 months ago
or lets calm down, this much espionage doesnt actually happen that much, and when it does, separating out people on need-to-know basis and introducing honeypots have been routine parts of the process for decades and costs nothing, no startup to be built here

"security startups that "monitor for corporate espionage"" imply introducing yet another third party that literally has access to all the things (or logs thereof) thereby introducing a nice fat pwn factor for everyone

makestuff · 5 months ago
Oh I agree it is a bad idea, but that doesn't mean it will not happen.
makestuff commented on Rippling sues Deel over spying   twitter.com/parkerconrad/... · Posted by u/amacneil
mattzito · 5 months ago
If you have a few minutes, reading the full complaint is worth it - the blog posts and the articles don't really do the whole story justice.

There is extremely damning evidence that this unnamed individual ("D.S.") in Ireland was acting at the behest of Deel senior leadership, including:

- the COO of deel reached out to a rippling payroll manager on linkedin to recruit them. The rippling employee didn't respond. Shortly thereafter, D.S. pulled up that employees personnel record in the HR system that has their unlisted phone number. Shortly after THAT, the COO of deel reached back out to that employee via WhatsApp and that phone number.

- The information was about to publish a story about Deel potentially violating sanctions. New information in the article was that at least one of the customers involved was a company called "tinybird". No one at rippling was aware that this company even existed, but a week BEFORE the article came out, but after the reporter had been asking questions of Deel, D.S. started searching Slack for "tinybird" (and there were no other searches of "tinybird" across the whole company)

- Around the same time, the reporter for the information reached out to rippling and had internal Rippling slack messages about potential similar sanctions violations. A short time before that happened, D.S. was suddenly searching for "russia", "sanctions", "iran", etc.

- There was an email between D.S. and the ceo of Deel, along with an introduction to someone from the family VC fund.

- And then, of course, the honeypot - a fake channel, fake chats from the Rippling CRO, but the chats had real stories that former Deel employees had alleged. Email sent to only the CEO of Deel, his dad/chairman of the board, and their GC. Just a short time later, D.S. was searching for the fake channel, trying to find it, adn trying to find these chat messages.

I'm sure the CEO will try to have plausible deniability, that it was someone else in his org that he delegated investigating these things to, he had no idea, etc. But if they can get D.S. to crack and share the details of what happened, I think it will be tough to toe that line.

makestuff · 5 months ago
IMO this is going to create a wave of product offerings from security startups that "monitor for corporate espionage" similar to what Meta was doing tracking copy/paste into whats app, but do it across all apps. Like detect for seldom searched keywords, etc.
makestuff commented on AWS Cat Qubits Make Quantum Error Correction Effective, Affordable   nextplatform.com/2025/02/... · Posted by u/rbanffy
makestuff · 6 months ago
Outside of breaking encryption, what class of problems are most likely to be solved if these chips are successful?
makestuff commented on Amazon’s delivery drones are grounded in College Station, Texas   wired.com/story/texas-ama... · Posted by u/impish9208
fnfjfk · 6 months ago
They are working on that, well... kind of. They somehow convinced the NYC government to allow them to legally operate little trucks in bike lanes. They claim that they are "bikes" (bikes have two wheels, that's what "bi" means, these have four)

https://www.reddit.com/r/NYCbike/comments/1gw1wlj/amazon_box...

makestuff · 6 months ago
I'm pretty sure you also do not need a driver's license to operate them since they are bikes.

u/makestuff

KarmaCake day890September 22, 2022View Original