xfalcox (u/xfalcox) - Readit News

xfalcox commented on Embedding Text Documents with Qwen3 daft.ai/blog/embedding-mi... · Posted by u/kiyanwang

ipsum2 · 15 hours ago

What does it mean to generate 1000 float16 array size on a 32k context? Surely the embedding you get is no longer representative of the text.

xfalcox · 14 hours ago

Depends on your needs. You surely don't want 32k long chunks for doing the standard RAG pipeline, that's for sure.

My use case is basically a recommendation engine, where retrieve a list of similar forum topics based on the current read one. As with dynamic user generated content, it can vary from 10 to 100k tokens. Ideally I would generate embeddings from an LLM generated summary, but that would increase inference costs considerably at the scale I'm applying it.

Having a larger possible context out of the box just made a simple swap of embeddeding models increase quality of recommendations greatly.

xfalcox commented on Embedding Text Documents with Qwen3 daft.ai/blog/embedding-mi... · Posted by u/kiyanwang

xfalcox · 17 hours ago

Just migrated all embeddings to this same model a few weeks ago in my company, and it's a game changer. Having 32k context is a 64x increase when compared with our previous used model. Plus being natively multilingual and producing very standard 1024 long arrays made it a seamless transition even with millions of embeddings across thousands of databases.

I do recommend using https://github.com/huggingface/text-embeddings-inference for fast inference.

xfalcox commented on Rerank-2.5 and rerank-2.5-lite: instruction-following rerankers blog.voyageai.com/2025/08... · Posted by u/fzliu

sroussey · 10 days ago

Not really sure why they have a HuggingFace presence.

https://huggingface.co/voyageai/rerank-2.5-lite

xfalcox · 10 days ago

Having a public tokenizer is quite useful, specially for embeddings. It allows you to do the chunking locally without going to the internet.

xfalcox commented on GPT-OSS vs. Qwen3 and a detailed look how things evolved since GPT-2 magazine.sebastianraschka... · Posted by u/ModelForge

cranberryturkey · 13 days ago

qwen3 is slow though. i used it. it worked, but it was slow and lacking features.

xfalcox · 13 days ago

Qwen 3 is not slow by any metrics.

Which model, inference software and hardware are you running it on?

The 30BA3B variant flies on any GPU.

xfalcox commented on Workhorse LLMs: Why Open Source Models Dominate Closed Source for Batch Tasks sutro.sh/blog/workhorse-l... · Posted by u/cmogni1

ramesh31 · 3 months ago

Flash is just so obscenely cheap at this point it's hard to justify the headache of self hosting though. Really only applies to sensitive data IMO.

xfalcox · 3 months ago

You'd be surprised how often people in enterprise can be left waiting months to get an API key approved for an LLM provider.

xfalcox commented on Storefront Web Components shopify.dev/docs/api/stor... · Posted by u/maltenuhn

blittle · 3 months ago

I'm on the dev team that built this. Happy to answer any questions!

We essentially use web components as a templating language to dynamically generate a GraphQL query to Shopify. Then render the data as text nodes inside the web components. This is powerful because the components don't include shadow roots. So you can come with your own HTML and CSS.

Most web component libraries are opinionated about design, and give you many CSS custom properties or CSS parts to customize. We tried really hard to invert that, and instead give you the design control. Most of our web components just produce a text node, with no shadow root!

There's a few exceptions, like the cart for example, where it's easier to just have an out of the box component that does it all for you `<shopify-cart>`. Though...you can actually build the entire cart component with the lower level primitives!

xfalcox · 3 months ago

This looks like a great fit for allowing people to monetize their Discourse forums, by having partners stores and plugging those instead of ads.

Will build a quick poc integration. How can I contact you with feedback?

xfalcox commented on Show HN: A backend agnostic Ruby framework for building reactive desktop apps codeberg.org/skinnyjames/... · Posted by u/zero-st4rs

xfalcox · 3 months ago

This looks super cool, exactly what I've been wanting to create some useful widgets! Thanks for sharing!

xfalcox commented on I analyzed chord progressions in 680k songs cantgetmuchhigher.com/p/i... · Posted by u/jnord

calibas · 4 months ago

From my experience playing guitar, the average punk or metal song is almost entirely power chords, while this data says power chords are only 5% of chords in both genres...

I thought maybe there's types of metal and punk that I don't know about, but Wikipedia, LLMs and guitar tab sites all agree with me. Punk and metal is overwhelmingly power chords, so I don't see how the data comparing chord types can be correct.

xfalcox · 4 months ago

I guess one aspect missing here is weighting more popular songs on that analysis.

I assume that the analysis is simply counting every song chords, so a unknown band you've never heard about has the same impact as The Ramones.

I'd like to see the same graph weighted by band popularity using either YouTube or Spotify data.

xfalcox commented on Cohere Launches Embed 4 cohere.com/blog/embed-4... · Posted by u/rekovacs

xfalcox · 4 months ago

No downloadable open weights ?

Looks like I'll stay on [bge-m3](https://huggingface.co/BAAI/bge-m3)

xfalcox commented on Brazil's government-run payments system has become dominant economist.com/the-america... · Posted by u/jcartw

marcosdumay · 5 months ago

And there's no surprise fraud claims.

xfalcox · 5 months ago

My wife runs a small retail makeup shop on Shopify, which started before pix and those surprise false fraud claims almost killed the business.

Pix was such a game changer. It is perfect.