Readit News logoReadit News
punnerud commented on Gemma 3 270M re-implemented in pure PyTorch for local tinkering   github.com/rasbt/LLMs-fro... · Posted by u/ModelForge
lsb · 8 days ago
That’s wild that with a KV cache and compilation on the Mac CPU you are faster than on an A100 GPU.
punnerud · 8 days ago
Because on Mac the CPU and GPU share memory, but A100 need to transfer to RAM/CPU on the parts that’s not supported by GPU?

(My first guess)

punnerud commented on Gemma 3 270M: Compact model for hyper-efficient AI   developers.googleblog.com... · Posted by u/meetpateltech
canyon289 · 13 days ago
A free colab. Here's a link, you can finetune the model in ~5 minutes in this example, and I encourage you to try your own

https://ai.google.dev/gemma/docs/core/huggingface_text_full_...

punnerud · 13 days ago
Finally a Google guide using PyTorch and not Tensorflow, that alone made me wanting to try it out ;)
punnerud commented on GPT-5   openai.com/gpt-5/... · Posted by u/rd
punnerud · 21 days ago
Wished this version would be called OpenAI-GPT-25.8
punnerud commented on Map Comparison Kit (2005)   mck.riks.nl/downloads... · Posted by u/punnerud
punnerud · 21 days ago
Not often anyone I come across webpages that is frozen in time
punnerud commented on Litestar is worth a look   b-list.org/weblog/2025/au... · Posted by u/todsacerdoti
punnerud · 21 days ago
Good to see it using port 8000 as default, and not Flasks 5000 (does not work on Mac anymore)

u/punnerud

KarmaCake day9376May 18, 2013
About
Data Scientist at Systek, a small company in Oslo/Norway.

morten.punnerud(at)systek.no

morten(at)punnerud.no

Linkedin: punnerud

[ my public key: https://keybase.io/punnerud; my proof: https://keybase.io/punnerud/sigs/efaOp0ejAsD9V_3PTIBgws4F9nDOgRb_-n6ArrRr0Xg ]

View Original