Readit News logoReadit News
lossolo commented on Why was Apache Kafka created?   bigdata.2minutestreaming.... · Posted by u/enether
enether · 17 hours ago
I would assume storage varies greatly. I know that LinkedIn quoted an average read fanout ratio of 5.5x in Kafka, meaning each byte was read 5.5x times. Assuming that is still true, we ought to divide by 6.5x to get to the daily write amount

That comes out to 87 disks a day. Assuming a 7 day retention period (this is on the high side), it’s not unthinkable to have a 600-1800 disk deployment (accounting for replication copies)

lossolo · 16 hours ago
> That comes out to 87 disks a day. Assuming a 7 day retention period (this is on the high side), it’s not unthinkable to have a 600-1800 disk deployment (accounting for replication copies)

Yep. Whole week can be easily stored in 1-2 racks.

lossolo commented on Libre – An anonymous social experiment without likes, followers, or ads   libreantisocial.com... · Posted by u/rododecba
schrodinger · 2 days ago
Actually, isn't this a bit like TikTok and why it allows low-follower profiles to rise to the top on occasion?
lossolo · 2 days ago
Yes, TikTok uses similar algorithm to push new content, that's why anyone can go viral there.
lossolo commented on Writing Speed-of-Light Flash Attention for 5090 in CUDA C++   gau-nernst.github.io/fa-5... · Posted by u/dsr12
neilmovva · 2 days ago
Not really:

5090: 210 TF / $2k == 105 TF/$k

B200: 2250 TF / $40k == 56 TF/$k

Getting only 2x the FLOPs per dollar probably isn't worth the hassle of having to rack 10x as many GPUs, while having no NVLink.

lossolo · 2 days ago
One of the reasons they removed NVLink from consumer cards (they supported it before). There’s also an issue with power consumption (1xB200 vs 10x5090)
lossolo commented on Io_uring, kTLS and Rust for zero syscall HTTPS server   blog.habets.se/2025/04/io... · Posted by u/guntars
commandersaki · 3 days ago
I'm sceptical of the efficiency gains with sendfile; seems marginal at best, even in the late 90s when it was at the height of popularity.
lossolo · 3 days ago
> seems marginal at best

Depends on the workload.

Normally you would go read() -> write() so:

1. Disk -> page cache (DMA)

2. Kernel -> user copy (read)

3. User -> kernel copy (write)

4. Kernel -> NIC (DMA)

sendfile():

1. Disk -> page cache (DMA)

No user space copies, kernel wires those pages straight to the socket

2. Kernel -> NIC (DMA)

So basically, it eliminates 1-2 memory copies along with the associated cache pollution and memory bandwidth overhead. If you are running high QPS web services where syscall and copy overheads dominate, for example CDNs/static file serving the gains can be really big. Based on my observations this can mean double digit reductions in CPU usage and up to ~2x higher throughput.

lossolo commented on The US Department of Agriculture Bans Support for Renewables   insideclimatenews.org/new... · Posted by u/mooreds
Loughla · 3 days ago
And why? That's the part that gets me. What's the reason? If we can develop renewables that brings the cost of energy down, how is that bad?
lossolo · 3 days ago
It wasn’t demonized for ideological reasons. Whose interest is it for renewables to fail? Follow the money.
lossolo commented on The US Department of Agriculture Bans Support for Renewables   insideclimatenews.org/new... · Posted by u/mooreds
IT4MD · 3 days ago
[flagged]
lossolo · 3 days ago
It’s pretty simple, actually, you have a problem with institutionalized corruption in your country. This was lobbied for and now executed. Drill, baby, drill.
lossolo commented on Io_uring, kTLS and Rust for zero syscall HTTPS server   blog.habets.se/2025/04/io... · Posted by u/guntars
butterisgood · 3 days ago
I just wish people who give this advice for 1 thread per core would "expand their reasoning" or "show the work".

It's not blanket good advice for all things.

lossolo · 3 days ago
Check out Scylla and its underlying framework Seastar. They expand their reasoning and show the work.
lossolo commented on From GPT-4 to GPT-5: Measuring progress through MedHELM [pdf]   fertrevino.com/docs/gpt5_... · Posted by u/fertrevino
hodgehog11 · 3 days ago
True, but the experiments are engineered to give results they want. It's a mathematical certainty that the performance will drop off here, but is not an accurate assessment of what is going on at scale. If you present an appropriately large and well-trained model with in-context patterns, it often does a decent job, even when it isn't trained on them. By nerfing the model (4 layers), the conclusion is foregone.

I honestly wish this paper actually showed what it claims, since it is a significant open problem to understand CoT reasoning relative to the underlying training set.

lossolo · 3 days ago
Without a provable hold out, claim that "large models do fine on unseen patterns" is unfalsifiable. In controlled from scratch training, CoT performance collapses under modest distribution shift, even with plausible chains. If you have results where the transformation family is provably excluded from training and a large model still shows robust CoT, please share them. Otherwise this paper’s claim stands for the regime it tests.
lossolo commented on Say farewell to the AI bubble, and get ready for the crash   latimes.com/business/stor... · Posted by u/taimurkazmi
NitpickLawyer · 5 days ago
> No, it wasn’t.

I replied below in this thread with the specific post, 6 months ago.

> After that, a top goal for us is to unify o-series models and GPT-series models by creating systems that can use all our tools, know when to think for a long time or not, and generally be useful for a very wide range of tasks.

> In both ChatGPT and our API, we will release GPT-5 as a system that integrates a lot of our technology, including o3. We will no longer ship o3 as a standalone model.

lossolo · 5 days ago
"the delta between 5 and 4 will be the same as between 4 and 3"[1]

Obviously it's not.

1. https://lexfridman.com/sam-altman-2-transcript/

lossolo commented on Analysis of the GFW's Unconditional Port 443 Block on August 20, 2025   gfw.report/blog/gfw_uncon... · Posted by u/kotri
chickenzzzzu · 5 days ago
Think of how many people who have remote jobs with American companies couldn't connect to their meetings while they "work from home" while secretly being in China!

Normally they have to fight VPN issues anyway, but having a sovereign state inject your packets is certainly a fun new one.

lossolo · 5 days ago
> Normally they have to fight VPN issues anyway

There are special virtual SIM cards that provide access to services from mainland China, as well as VPNs that function normally without issues. I used both while I was in China.

u/lossolo

KarmaCake day2875January 6, 2016View Original