Readit News logoReadit News
fzimmermann89 commented on Ask HN: Best foundation model for CLM fine-tuning?    · Posted by u/philomath868
fzimmermann89 · 5 days ago
How foreign is the language - was it likely included in pre training to some degree? Does it use grammar, syllables, and logic similiar to one of the large languages? Your approach assumes there is an easy to learn mapping between context in your target language and concepts in a prettained llm.

Can you get more text written in the low resources language?

Are you ok to share the name of the language?

fzimmermann89 · 5 days ago
Also, for an auto complete I think a small llm trained from scratch should already work well. Have you tried on if the tinystories(also only 3gb..)/nanogpt speed runs without any fancy loss terms etc as a baseline?
fzimmermann89 commented on Ask HN: Best foundation model for CLM fine-tuning?    · Posted by u/philomath868
fzimmermann89 · 5 days ago
How foreign is the language - was it likely included in pre training to some degree? Does it use grammar, syllables, and logic similiar to one of the large languages? Your approach assumes there is an easy to learn mapping between context in your target language and concepts in a prettained llm.

Can you get more text written in the low resources language?

Are you ok to share the name of the language?

fzimmermann89 commented on Apple bans entire dev account, no reason given   twitter.com/rameerez/stat... · Posted by u/eecc
runjake · 2 months ago
3.2f.

“You will not, directly or indirectly, commit any act intended to interfere with any of the Apple Software or Services“

fzimmermann89 · 2 months ago
Contacting support obviously interfered with Apple services. Duh.
fzimmermann89 commented on Detecting edges of images at the speed of light   phys.org/news/2025-01-edg... · Posted by u/bookofjoe
fzimmermann89 · 7 months ago
If I am not mistaken, this is done by modulation in Fourier space. We have already been using this in optical setups for ages - at the speed of light.

The interesting part imo is the implementation of this idea in their work and the efficiency and physical size.

fzimmermann89 commented on TwoSet Violin 'ends chapter' after eleven years   thestrad.com/news/twoset-... · Posted by u/botto
fzimmermann89 · a year ago
Sad to hear that they removed most of their content as well.
fzimmermann89 commented on What's New in the Windows Subsystem for Linux in May 2024   devblogs.microsoft.com/co... · Posted by u/ulrischa
fzimmermann89 · a year ago
If only they would fix the memory leak and freeze on resume from hibernate that has been an issue for the last year at least...
fzimmermann89 commented on Windows 11 is amazing, I left Linux   old.reddit.com/r/Windows1... · Posted by u/quyleanh
fzimmermann89 · 2 years ago
I thought the same, until I noticed a really annoying WSL2 bug: On two machines I own, waking up from hibernate or standby causes a wsl related process (vmmem) to consume 100% CPU, with wsl becoming completely unresponsive (including wsl terminate etc).

You have to kill all wsl processes, which requires admin rights. So without elevated rights, Ubuntu on windows is not usable on these laptops.

The issue is known for years and has hundreds of comments on GitHub without a fix (https://github.com/microsoft/WSL/issues/6982)

fzimmermann89 commented on Netflix loses 1M users in Spain over password policing   bloomberg.com/news/articl... · Posted by u/FabHK
johnmaguire · 2 years ago
Well, when the shoe fits...
fzimmermann89 · 2 years ago
*shoes.
fzimmermann89 commented on Introducing ChatGPT and Whisper APIs   openai.com/blog/introduci... · Posted by u/minimaxir
monkmartinez · 3 years ago
> Especially with Meta's new Llama models outperforming GPT-3

Do you have access to the models? It is being discussed all over the Discords and most seem to think getting access is not happening unless you are dialed in.

fzimmermann89 · 3 years ago
I got access by providing an academic email adress without mentioning any relevant publications etc.. Took maybe 2-3 days..

u/fzimmermann89

KarmaCake day33February 13, 2019View Original