Readit News logoReadit News
Zvez commented on Building the heap: racking 30 petabytes of hard drives for pretraining   si.inc/posts/the-heap/... · Posted by u/nee1r
Zvez · 3 months ago
That's basically scaled up story of 'I store my files on my computer and it is 10x cheaper than using dropbox'

While disks fail rate is already explored in another threads here, there is one related thing that catch my interest. Disk failure in such setup is not just cost of new disk + replacement cost (someone has to go there and change it!). It also inconvenience with dealing with failing requests. Ok, you are willing to lose 5% of your dataset. But are your '200-lines of code' robust enough to handle such cases. What if disk didn't fail, but start to be veeeeery slow. Does your training process can efficiently skip such bad objects. Do you have enough transparency to understand how much data you already lost? Is it still below 5%? And so on and so forth.

I feel like this article was written right after they built this construction and before let say 6 months of usage. Because I'm pretty sure their costs will go much higher than they calculated here. Especially if they start including hidden costs, like the work needed to be done on training side.

Yes, cost for self-hosting most probably still be less than aws (aws is not cheap). But it might start to be comparable with storage solutions of small ('neo') cloud providers if you buy gpu there.

Zvez commented on Colossus for Rapid Storage   cloud.google.com/blog/pro... · Posted by u/alobrah
CobrastanJorji · 9 months ago
They're not saying that it's AI. They're saying it's for customers who do AI. Training means lots and lots of reads from a big data store, and if you're reading from, like, big Parquet files, that probably means lots of random reads. This is for that. Speedier data access, presumably at the cost of durability and availability, which is probably a great trade-off for people doing ML training jobs.
Zvez · 9 months ago
calling everything 'for AI' is the new standard

>if you're reading from, like, big Parquet files, that probably means lots of random reads

and it also usually means that you shouldn't use s3 in the first place for workloads like this. Because they are usually very inefficient comparing to distributed fs. Unless you have some prefetch/cache layer, you will get both bad timings and higher costs

Zvez commented on Helldivers 2 Removed from Purchase on Steam in over 150 Countries   thegamer.com/helldivers-2... · Posted by u/marijnz
haunter · 2 years ago
>entirely ignores that a PSN account was required from the beginning but waived because of the unexpected popularity of the game

They shouldn't have sold the game on Steam in countries where PSN is not available.

Zvez · 2 years ago
>They shouldn't have sold the games on Steam in countries where PSN is not available

yep, as simple as that this would have made the situation much better. Game would have gotten lower scores and player base overall. Doing it now and like this leaves the feeling, that they got enough money from sales and now they want to bring some traffic to psn

Zvez commented on Skip the API, ship your database   fly.io/blog/skip-the-api/... · Posted by u/danielskogly
Zvez · 2 years ago
If you give access to your DB directly, your API effectively becomes your API with all the contract obligations of the API. Suddenly you don't completely control your schema: you can't freely change it, you need to add things there for your clients only. I've seen it done multiple times and it always end up poorly. You save some time now by removing the need to build API, but later you end up spending much more time trying to decouple your internal representation from schema you made public.
Zvez commented on Ask HN: What's the best self hosted/local alternative to GPT-4?    · Posted by u/surrTurr
inductive_magic · 3 years ago
OpenAI builds an LLM and an api-interface to that model.

The design of abstractions, prompt engineering, custom fine-tunes and software engineering required to ship a valuable application on top of that interface counts as "building an app" in my book.

Zvez · 3 years ago
while this is technically correct, difference is that your product won't survive without openai at this point. If you need the model quality openai provides you are stuck and your product can just disappear. Because llm is core building block, irreplaceable one.
Zvez commented on Legend of Zelda game sells 10M copies in three days   finance.yahoo.com/news/le... · Posted by u/he0001
sanitycheck · 3 years ago
Nintendo didn't really need to do any marketing beyond "Hey we are releasing a new Zelda game, it will run on the Switch you bought years ago mostly in order to play the previous Zelda game".

I've been playing games for 30+ years, at no point have I cared deeply about the difference between 30fps and 60fps. I do not like 5fps-stuttering, which happens sometimes in poorly optimised AAA games no matter how good (within reason) the GPU.

The new Zelda runs well and looks good. I like it, though it's very much "BOTW Continued" which maybe we didn't need, I might have preferred a completely fresh world & characters instead of the same Hyrule a few years later.

Zvez · 3 years ago
>Nintendo didn't really need to do any marketing

still they did.

I'm in Warsaw and I see countless of zelda ads: whole walls of huge buildings are covered with them. They probably spent insane amount of money if they advertise it like this everywhere

Zvez commented on CS 61B Data Structures, Spring 2023 UC Berkeley   sp23.datastructur.es/... · Posted by u/curious16
noobdev9000 · 3 years ago
Is there a point for working programmer to learn these in 2023, rather than just treating them as abstract interfaces?
Zvez · 3 years ago
I guess it depends on what you do and your goals. It might be not necessary to do average developers job (and full disclosure, it wasn't necessary 10 years ago as well). But understanding fundamentals gives you insights to be better prepared to choose right 'interface' when you need to. Also you can see it as a way to stretch your 'programmer muscles'.

After going through lecture topics list, I think most of those you actually need to know as a working programmer. Not because they are prerequisites, but because after couple of years in the field, you will have to touch most of those topics anyway.

Zvez commented on Hetzner Introduces ARM64 Cloud Servers   hetzner.com/press-release... · Posted by u/j4nek
martinald · 3 years ago
Agreed entirely for most. I have seen so many insanely complicated and eye wateringly expensive architectures for what are simple web apps on the big clouds. Often the team don't really understand the intricacies of all these services and how they interrelate which causes often cause a lot of stability problems.

Even if they don't, there is usually a huge tax on developer productivity with these architectures, which often outweighs any improved stability over the long term.

Let's keep in mind that modern hardware and software is very stable, generally.

Zvez · 3 years ago
>Let's keep in mind that modern hardware and software is very stable, generally

Not at any significant scale. dimm will fail, power will be down, disks will need replacement.

It is all about risks after all. If you are ok to have couple of hours of downtime if one of the memory sticks stops working - good for you. But generally any large enough business won't tolerate such risks.

I'm not saying that the cloud is the answer, but I don't see any future for single instance solution. And if you design your system like this, you are taking much more risks than necessary.

Zvez commented on I lost everything that made me love my job through Midjourney   old.reddit.com/r/blender/... · Posted by u/Fraterkes
vasco · 3 years ago
Do you think it takes as many people to build whatever software you want today vs in the 80s? I agree more software can be built with the improvements, but per unit of functionality, way less people are needed, or they can do it much faster, which in both cases mean you spend less of your revenue on labor, which means "less jobs".
Zvez · 3 years ago
their point is simple: during the last 20-30 years we had efficiency improvements by orders of magnitude. Still we didn't have developers jobs reduction in any way. We evolved into doing more and more high-level and high-scale jobs. So no, -10% of time needed to build the same unit of functionality won't lead to -10% reduction of workforce. It will change the scope of said workforce.
Zvez commented on “Clean” code, horrible performance   computerenhance.com/p/cle... · Posted by u/eapriv
whstl · 3 years ago
Not necessarily in the general case. If the program is a single user, locally ran program, then sure. If it's some sort of backend or distributed service, this is just wasted performance that can be used to serve more users. Virtually every distributed service built today is able to take advantage of this.

With a non-pessimal design, not only you are able to pay less money on servers in the long term, you're also able to delay complex scaling strategies. Scaling also costs money.

Not to mention that building something with this kind of overhead also means that a lot developer time was spent in the first place, which is still expensive in our industry.

Zvez · 3 years ago
using 'unclean' code practice will increase development costs. And more importantly - maintainability of such code.

>Virtually every distributed service built today is able to take advantage of this

most of built today services can take much more advantage in using better system design practices.

u/Zvez

KarmaCake day244August 17, 2010View Original