osanseviero (u/osanseviero)

osanseviero commented on Gemma 3n preview: Mobile-first AI developers.googleblog.com... · Posted by u/meetpateltech

jdiff · 10 months ago

There's a 7B mentioned in the chat arena ELO graph, I don't see any other references to it though.

osanseviero · 10 months ago

Hi! The model is 8B if you also load the vision and audio components. We just used the text model in LMArena.

osanseviero commented on Gemma 3 QAT Models: Bringing AI to Consumer GPUs developers.googleblog.com... · Posted by u/emrah

noodletheworld · a year ago

https://huggingface.co/google/gemma-3-27b-it-qat-q4_0-gguf/t...

> 17 days ago

Anywaaay...

I'm literally asking, quite honestly, if this is just an 'after the fact' update literally weeks later, that they uploaded a bunch of models, or if there is something more significant about this I'm missing.

osanseviero · a year ago

Hi! Omar from the Gemma team here.

Last time we only released the quantized GGUFs. Only llama.cpp users could use it (+ Ollama, but without vision).

Now, we released the unquantized checkpoints, so anyone can quantize themselves and use in their favorite tools, including Ollama with vision, MLX, LM Studio, etc. MLX folks also found that the model worked decently with 3 bits compared to naive 3-bit, so by releasing the unquantized checkpoints we allow further experimentation and research.

TL;DR. One was a release in a specific format/tool, we followed-up with a full release of artifacts that enable the community to do much more.

osanseviero commented on Gemma 3 Technical Report [pdf] storage.googleapis.com/de... · Posted by u/meetpateltech

LeoPanthera · a year ago

Doesn't yet work in LM Studio. Barfs an error when trying to load the model. (Error 6, whatever that means. Happy I missed the first 5.)

osanseviero · a year ago

Please make sure to update to the latest llama.cpp version

osanseviero commented on Plasticlist Report – Data on plastic chemicals in Bay Area foods plasticlist.org/report... · Posted by u/jeff18

sizzle · a year ago

This research by these non academic background folks is simply astounding and exceptional. How did they get the funding to run $500k of independent lab testing? Can we donate to the cause?

This stuff is on my mind all the time eating out or from plastic-impregnated cardboard food packaging lining, etc. I’m worried about reproductive impact on future generations and overall personal health, etc.

osanseviero · a year ago

Nat Friedman leads the project. He was GitHub's CEO, among many other things. He funds many interesting ambitious projects, such as the Vesuvius Challenge (https://scrollprize.org/)

osanseviero commented on BERTs Are Generative In-Context Learners arxiv.org/abs/2406.04823... · Posted by u/fzliu

srameshc · a year ago

As someone who has very limited understanding but tried to use BERT for classification, is BERT still relavant when compared to LLMs ? Asking because I hardly see any mention of BERTs anymore.

osanseviero · a year ago

Yes, they are still used

- Encoder based models have much faster inference (are auto-regressive) and are smaller. They are great for applications where speed and efficiency are key. - Most embedding models are BERT-based (see MTEB leaderboard). So widely used for retrieval. - They are also used to filter data for pre-training decoder models. The Llama 3 authors used a quality classifier (DistilRoberta) to generate quality scores for documents. Something similar is done for FineWeb Edu

Posted by u/osanseviero a year ago

IBM and NASA Release OS Model for Weather and Climate Applications newsroom.ibm.com/2024-09-...

osanseviero commented on Open source AI is the path forward about.fb.com/news/2024/07... · Posted by u/atgctg

patrickaljord · 2 years ago

Is there an LLM with actual open source training code and dataset? Besides BLOOM https://huggingface.co/bigscience/bloom

osanseviero · 2 years ago

Yes, there are a few dozen full open source models (license, code, data, models)

osanseviero commented on DevRel at HuggingFace dx.tips/huggingface... · Posted by u/swyx

osanseviero · 2 years ago

Hi all! I'm Omar from Hugging Face. Happy to answer any questions you might have about Hugging Face in general, llamas, and open ML!

Posted by u/osanseviero 2 years ago

State of Open AI – July Edition docs.google.com/presentat...

Posted by u/osanseviero 2 years ago

Brew Install Llama.cpp twitter.com/ggerganov/sta...

u/osanseviero

KarmaCake day532February 2, 2015View Original