Readit News logoReadit News
osanseviero commented on Gemma 3n preview: Mobile-first AI   developers.googleblog.com... · Posted by u/meetpateltech
jdiff · 10 months ago
There's a 7B mentioned in the chat arena ELO graph, I don't see any other references to it though.
osanseviero · 10 months ago
Hi! The model is 8B if you also load the vision and audio components. We just used the text model in LMArena.
osanseviero commented on Gemma 3 QAT Models: Bringing AI to Consumer GPUs   developers.googleblog.com... · Posted by u/emrah
noodletheworld · a year ago
https://huggingface.co/google/gemma-3-27b-it-qat-q4_0-gguf/t...

> 17 days ago

Anywaaay...

I'm literally asking, quite honestly, if this is just an 'after the fact' update literally weeks later, that they uploaded a bunch of models, or if there is something more significant about this I'm missing.

osanseviero · a year ago
Hi! Omar from the Gemma team here.

Last time we only released the quantized GGUFs. Only llama.cpp users could use it (+ Ollama, but without vision).

Now, we released the unquantized checkpoints, so anyone can quantize themselves and use in their favorite tools, including Ollama with vision, MLX, LM Studio, etc. MLX folks also found that the model worked decently with 3 bits compared to naive 3-bit, so by releasing the unquantized checkpoints we allow further experimentation and research.

TL;DR. One was a release in a specific format/tool, we followed-up with a full release of artifacts that enable the community to do much more.

osanseviero commented on Gemma 3 Technical Report [pdf]   storage.googleapis.com/de... · Posted by u/meetpateltech
LeoPanthera · a year ago
Doesn't yet work in LM Studio. Barfs an error when trying to load the model. (Error 6, whatever that means. Happy I missed the first 5.)
osanseviero · a year ago
Please make sure to update to the latest llama.cpp version
osanseviero commented on Plasticlist Report – Data on plastic chemicals in Bay Area foods   plasticlist.org/report... · Posted by u/jeff18
sizzle · a year ago
This research by these non academic background folks is simply astounding and exceptional. How did they get the funding to run $500k of independent lab testing? Can we donate to the cause?

This stuff is on my mind all the time eating out or from plastic-impregnated cardboard food packaging lining, etc. I’m worried about reproductive impact on future generations and overall personal health, etc.

osanseviero · a year ago
Nat Friedman leads the project. He was GitHub's CEO, among many other things. He funds many interesting ambitious projects, such as the Vesuvius Challenge (https://scrollprize.org/)
osanseviero commented on BERTs Are Generative In-Context Learners   arxiv.org/abs/2406.04823... · Posted by u/fzliu
srameshc · a year ago
As someone who has very limited understanding but tried to use BERT for classification, is BERT still relavant when compared to LLMs ? Asking because I hardly see any mention of BERTs anymore.
osanseviero · a year ago
Yes, they are still used

- Encoder based models have much faster inference (are auto-regressive) and are smaller. They are great for applications where speed and efficiency are key. - Most embedding models are BERT-based (see MTEB leaderboard). So widely used for retrieval. - They are also used to filter data for pre-training decoder models. The Llama 3 authors used a quality classifier (DistilRoberta) to generate quality scores for documents. Something similar is done for FineWeb Edu

osanseviero commented on Open source AI is the path forward   about.fb.com/news/2024/07... · Posted by u/atgctg
patrickaljord · 2 years ago
Is there an LLM with actual open source training code and dataset? Besides BLOOM https://huggingface.co/bigscience/bloom
osanseviero · 2 years ago
Yes, there are a few dozen full open source models (license, code, data, models)
osanseviero commented on DevRel at HuggingFace   dx.tips/huggingface... · Posted by u/swyx
osanseviero · 2 years ago
Hi all! I'm Omar from Hugging Face. Happy to answer any questions you might have about Hugging Face in general, llamas, and open ML!

u/osanseviero

KarmaCake day532February 2, 2015View Original