Readit News logoReadit News
rasbt commented on ReMamba: Equip Mamba with Effective Long-Sequence Modeling   arxiv.org/abs/2408.15496... · Posted by u/PaulHoule
rasbt · a year ago
Thanks for sharing!
rasbt commented on Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer)   youtube.com/watch?v=kPGTx... · Posted by u/rasbt
objektif · a year ago
When the topic under discussion is incredibly complex that even researchers in mentioned companies do not understand. This is like saying lets learn how combustion inside airplane engines work to get a better understanding of what LLMs can do.

Is it not better to focus your limited time on things that you can understand?

rasbt · a year ago
I disagree here: Setting up a large-scale pretraining run is super complex if you have to manage your distributed computing platform, but looking at how the training data looks like and is fed into an LLM is not that complex. If you are developing a product based on or with LLMs, it's worth spending a few hours to understand it on the big-picture level. I mean, look at how many people are confused why LLMs a) hallucinate facts, b) sometimes copy text passages verbatim, c) why they probably shouldn't be used as scientific calculators etc. All that could be much more clear if you know how they are trained.
rasbt commented on Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer)   youtube.com/watch?v=kPGTx... · Posted by u/rasbt
pcloadletter_ · a year ago
I find it can be nice to have an academic understanding of things you work with even if you don't have to develop it directly yourself.
rasbt · a year ago
Agreed, understanding how a method works and how it would be done helps with developing an intuition for its limitations -- what it can and what it can't do
rasbt commented on Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer)   youtube.com/watch?v=kPGTx... · Posted by u/rasbt
htrp · a year ago
Not Sebastian (who I assume is the OP), but his blog/substack is also a great resource

https://magazine.sebastianraschka.com/

rasbt · a year ago
thanks for mentioning, that makes me super happy to hear!

u/rasbt

KarmaCake day1717June 6, 2014
About
AI researcher and statistics professor
View Original