msoad (u/msoad) - Readit News

msoad commented on Nvidia’s $589B DeepSeek rout finance.yahoo.com/news/as... · Posted by u/rcarmo

qqtt · 7 months ago

Well you have to keep in mind that Nvidia has a 3 trillion dollar valuation. That kind of heavy valuation comes with heavy expectations about future growth. Some of those assumptions about future Nvidia growth are their ability to maintain their heavy growth rates, for very far into the future.

Training is a huge component of Nvidia's projected growth. Inference is actually much more competitive, but training is almost exclusively Nvidia's domain. If Deepseek's claims are true, that would represent a 10x reduction in cost for training for similar models (6 million for r1 vs 60 million for something like o1).

It is absolutely not the case in ML that "there is nothing bad about more resources". There is something very bad - cost. And another bad thing - depreciation. And finally, another bad thing - the fact that new chips and approaches are coming out all the time, so if you are on older hardware you might be missing out. Training complex models for cheaper will allow companies to potentially re-allocate away from hardware into software (ie, hiring more engineering to build more models, instead of less engineers and more hardware to build less models).

Finally, there is a giant elephant in the room that it is very unclear if throwing more resources at LLM training will net better results. There are diminishing returns in terms of return on investment in training, especially with LLM-style use cases. It is actually very non-obvious right now how pouring more compute specifically at training will result in better LLMs.

msoad · 7 months ago

My layman view is that more compute (more reasoning) will not solve harder problems. I'm using those models every day and when problem hits a certain complexity it will fail, no matter how much it "reasons"

msoad commented on DeepSeek releases Janus Pro, a text-to-image generator [pdf] github.com/deepseek-ai/Ja... · Posted by u/reissbaker

blitzar · 7 months ago

We get free Ai from a hedge fund and $200/month Ai from a nonprofit.

msoad · 7 months ago

I hope the hedge fund shorted NVDA to make some good money along the way too hahaha!

msoad commented on DeepSeek releases Janus Pro, a text-to-image generator [pdf] github.com/deepseek-ai/Ja... · Posted by u/reissbaker

erulabs · 7 months ago

One thing I'd love to hear opinions on from someone with more free time to read these papers from DeepSeek is: am I right to feel like they're... publishing all their secret sauce? The paper for R1 (1) seems to be pretty clear how they got such good results with so little horsepower (see: 'Group Relative Policy Optimization'). Is it not likely that Facebook, OpenAI, etc will just read these papers and implement the tricks? Am I missing something?

1. https://arxiv.org/abs/2501.12948

msoad · 7 months ago

They could make a ton of money shorting NVDA and releasing the paper. The most honest short position ever!

msoad commented on Nvidia’s $589B DeepSeek rout finance.yahoo.com/news/as... · Posted by u/rcarmo

msoad · 7 months ago

We are at multiple trillion dollar investment territory, all purely based on the idea that "to make AI you need lots of GPU and power"

msoad commented on Nvidia’s $589B DeepSeek rout finance.yahoo.com/news/as... · Posted by u/rcarmo

belter · 7 months ago

They have a lot of H100: https://www.reddit.com/r/NVDA_Stock/comments/1iadc0s/evidenc...

msoad · 7 months ago

four GPUs are very convincing indeed! :D

msoad commented on Qwen2.5-1M: Deploy your own Qwen with context length up to 1M tokens qwenlm.github.io/blog/qwe... · Posted by u/meetpateltech

dr_kiszonka · 7 months ago

How do you get those large codebases into AI Studio? Concat everything into one big file?

msoad · 7 months ago

I use yek

https://github.com/bodo-run/yek

msoad commented on Show HN: Lightpanda, an open-source headless browser in Zig github.com/lightpanda-io/... · Posted by u/fbouvier

nwienert · 7 months ago

Playwright can run webkit very easily and it's dramatically less resource-intensive than Chrome.

msoad · 7 months ago

Does it work nicely on Linux? I'm very curious about this

msoad commented on Show HN: Trolling SMS spammers with Ollama evan.widloski.com/softwar... · Posted by u/Evidlo

msoad · 7 months ago

All I want is an iPhone Shortcuts script to delete messages like "Hi" and "Hey" from unknown numbers. I get so many of those and having to delete them is a pain.

Shortcuts does not allow deleting messages apparently :(