Readit News logoReadit News
scribu commented on Claude Opus 4 and 4.1 can now end a rare subset of conversations   anthropic.com/research/en... · Posted by u/virgildotcodes
ogyousef · 9 days ago
3 Years in and we still dont have a useable chat fork in any of the major LLM chatbots providers.

Seems like the only way to explore differnt outcomes is by editing messages and losing whatever was there before the edit.

Very annoying and I dont understand why they all refuse to implement such a simple feature.

scribu · 9 days ago
ChatGPT Plus has that (used to be in the free tier too). You can toggle between versions for each of your messages with little left-right arrows.
scribu commented on Achieving 10,000x training data reduction with high-fidelity labels   research.google/blog/achi... · Posted by u/badmonster
scribu · 16 days ago
I’m confused by the clustering step:

> To find the most informative examples, we separately cluster examples labeled clickbait and examples labeled benign, which yields some overlapping clusters

How can you get overlapping clusters if the two sets of labelled examples are disjoint?

scribu commented on Vision Now Available in Llama.cpp   github.com/ggml-org/llama... · Posted by u/redman25
simonw · 3 months ago
This is the most useful documentation I've found so far to help understand how this works: https://github.com/ggml-org/llama.cpp/tree/master/tools/mtmd...
scribu · 3 months ago
It’s interesting that they decided to move all of the architecture-specific image-to-embedding preprocessing into a separate library.

Similar to how we ended up with the huggingface/tokenizers library for text-only Tranformers.

Deleted Comment

scribu commented on LLMs can see and hear without any training   github.com/facebookresear... · Posted by u/T-A
scribu · 4 months ago
This seems to be a system to generate better prompts to be fed into a base multimodal model.

Interesting, but title is definitely clickbait.

scribu commented on Sapphire: Rust based package manager for macOS   github.com/alexykn/sapphi... · Posted by u/adamnemecek
watermelon0 · 4 months ago
IIRC the main reason here is that brew path is hardcoded during the build process of packages, which means that you wouldn't be able to use bottles.

I didn't check, but there is a chance that path is also hardcoded in (some) formulae, so even building from the source might not help here.

scribu · 4 months ago
You could run the build process with chroot or inside Docker, so that the hardcoded paths actually resolve to a designated subdirectory.
scribu commented on Skywork-OR1: new SOTA 32B thinking model with open weight   github.com/SkyworkAI/Skyw... · Posted by u/naomiclarkson
scribu · 4 months ago
From their Notion page:

> Skywork-OR1-32B-Preview delivers the 671B-parameter Deepseek-R1 performance on math tasks (AIME24 and AIME25) and coding tasks (LiveCodeBench).

Impressive, if true: much better performance than the vanilla distills of R1.

Plus it’s a fully open-source release (including data selection and training code).

u/scribu

KarmaCake day4035April 18, 2009
About
Machine Learning Engineer, living in London

Email: cristi@burca.ro

View Original