Readit News logoReadit News
xfalcox commented on Embedding Text Documents with Qwen3   daft.ai/blog/embedding-mi... · Posted by u/kiyanwang
ipsum2 · 15 hours ago
What does it mean to generate 1000 float16 array size on a 32k context? Surely the embedding you get is no longer representative of the text.
xfalcox · 14 hours ago
Depends on your needs. You surely don't want 32k long chunks for doing the standard RAG pipeline, that's for sure.

My use case is basically a recommendation engine, where retrieve a list of similar forum topics based on the current read one. As with dynamic user generated content, it can vary from 10 to 100k tokens. Ideally I would generate embeddings from an LLM generated summary, but that would increase inference costs considerably at the scale I'm applying it.

Having a larger possible context out of the box just made a simple swap of embeddeding models increase quality of recommendations greatly.

xfalcox commented on Embedding Text Documents with Qwen3   daft.ai/blog/embedding-mi... · Posted by u/kiyanwang
xfalcox · 17 hours ago
Just migrated all embeddings to this same model a few weeks ago in my company, and it's a game changer. Having 32k context is a 64x increase when compared with our previous used model. Plus being natively multilingual and producing very standard 1024 long arrays made it a seamless transition even with millions of embeddings across thousands of databases.

I do recommend using https://github.com/huggingface/text-embeddings-inference for fast inference.

xfalcox commented on Rerank-2.5 and rerank-2.5-lite: instruction-following rerankers   blog.voyageai.com/2025/08... · Posted by u/fzliu
sroussey · 10 days ago
Not really sure why they have a HuggingFace presence.

https://huggingface.co/voyageai/rerank-2.5-lite

xfalcox · 10 days ago
Having a public tokenizer is quite useful, specially for embeddings. It allows you to do the chunking locally without going to the internet.
xfalcox commented on GPT-OSS vs. Qwen3 and a detailed look how things evolved since GPT-2   magazine.sebastianraschka... · Posted by u/ModelForge
cranberryturkey · 13 days ago
qwen3 is slow though. i used it. it worked, but it was slow and lacking features.
xfalcox · 13 days ago
Qwen 3 is not slow by any metrics.

Which model, inference software and hardware are you running it on?

The 30BA3B variant flies on any GPU.

xfalcox commented on Workhorse LLMs: Why Open Source Models Dominate Closed Source for Batch Tasks   sutro.sh/blog/workhorse-l... · Posted by u/cmogni1
ramesh31 · 3 months ago
Flash is just so obscenely cheap at this point it's hard to justify the headache of self hosting though. Really only applies to sensitive data IMO.
xfalcox · 3 months ago
You'd be surprised how often people in enterprise can be left waiting months to get an API key approved for an LLM provider.
xfalcox commented on Storefront Web Components   shopify.dev/docs/api/stor... · Posted by u/maltenuhn
blittle · 3 months ago
I'm on the dev team that built this. Happy to answer any questions!

We essentially use web components as a templating language to dynamically generate a GraphQL query to Shopify. Then render the data as text nodes inside the web components. This is powerful because the components don't include shadow roots. So you can come with your own HTML and CSS.

Most web component libraries are opinionated about design, and give you many CSS custom properties or CSS parts to customize. We tried really hard to invert that, and instead give you the design control. Most of our web components just produce a text node, with no shadow root!

There's a few exceptions, like the cart for example, where it's easier to just have an out of the box component that does it all for you `<shopify-cart>`. Though...you can actually build the entire cart component with the lower level primitives!

xfalcox · 3 months ago
This looks like a great fit for allowing people to monetize their Discourse forums, by having partners stores and plugging those instead of ads.

Will build a quick poc integration. How can I contact you with feedback?

xfalcox commented on Show HN: A backend agnostic Ruby framework for building reactive desktop apps   codeberg.org/skinnyjames/... · Posted by u/zero-st4rs
xfalcox · 3 months ago
This looks super cool, exactly what I've been wanting to create some useful widgets! Thanks for sharing!
xfalcox commented on I analyzed chord progressions in 680k songs   cantgetmuchhigher.com/p/i... · Posted by u/jnord
calibas · 4 months ago
From my experience playing guitar, the average punk or metal song is almost entirely power chords, while this data says power chords are only 5% of chords in both genres...

I thought maybe there's types of metal and punk that I don't know about, but Wikipedia, LLMs and guitar tab sites all agree with me. Punk and metal is overwhelmingly power chords, so I don't see how the data comparing chord types can be correct.

xfalcox · 4 months ago
I guess one aspect missing here is weighting more popular songs on that analysis.

I assume that the analysis is simply counting every song chords, so a unknown band you've never heard about has the same impact as The Ramones.

I'd like to see the same graph weighted by band popularity using either YouTube or Spotify data.

xfalcox commented on Cohere Launches Embed 4   cohere.com/blog/embed-4... · Posted by u/rekovacs
xfalcox · 4 months ago
No downloadable open weights ?

Looks like I'll stay on [bge-m3](https://huggingface.co/BAAI/bge-m3)

xfalcox commented on Brazil's government-run payments system has become dominant   economist.com/the-america... · Posted by u/jcartw
marcosdumay · 5 months ago
And there's no surprise fraud claims.
xfalcox · 5 months ago
My wife runs a small retail makeup shop on Shopify, which started before pix and those surprise false fraud claims almost killed the business.

Pix was such a game changer. It is perfect.

u/xfalcox

KarmaCake day743November 7, 2013
About
[ my public key: https://keybase.io/falcofantastic; my proof: https://keybase.io/falcofantastic/sigs/_PKYsKf2wmCyt834lEh6N4POje9RoICd3Ta7qezTzJE ]
View Original