Readit News logoReadit News
jankovicsandras commented on 7 Years, 2 Rebuilds, 40K+ Stars: Milvus Recap and Roadmap   milvus.io/blog/milvus-exc... · Posted by u/Fendy
gardnr · 3 days ago
pgVector is great and so is FAISS, but those are just a subset of what you get from Milvus. If all you need to do is RAG over 50Mb of documents then pick the right tool for the job. I use Chroma for a lot of projects.

Then, what if you want hybrid search, or different IVF variants, or disk-based search, or horizontal scaling, or something that leverages SIMD, or sparse vectors? Milvus is great.

jankovicsandras · 3 days ago
You can do hybrid search in Postgres.

Shameless plug: https://github.com/jankovicsandras/plpgsql_bm25 BM25 search implemented in PL/pgSQL ( Unlicense / Public domain )

The repo includes plpgsql_bm25rrf.sql : PL/pgSQL function for hybrid search ( plpgsql_bm25 + pgvector ) with Reciprocal Rank Fusion; and Jupyter notebook examples.

jankovicsandras commented on Show HN: Fanfa – Interactive and animated Mermaid diagrams   fanfa.dev/... · Posted by u/bairess
jankovicsandras · 10 days ago
Looks cool!

Here's some feedback:

- the diagram is not centered and zoomed by default, this is easy to fix

- it would be great to have better animation controls, like progressbar, play half speed, etc.

- it would be great to be able to export animation to video, animgif, etc. e. g. with ffmpeg.js

jankovicsandras commented on The Case Against PGVector   alex-jacobs.com/posts/the... · Posted by u/tacoooooooo
jankovicsandras · 2 months ago
Shameless plug: https://github.com/jankovicsandras/plpgsql_bm25 BM25 search implemented in PL/pgSQL ( Unlicense / Public domain )

The repo includes plpgsql_bm25rrf.sql : PL/pgSQL function for Hybrid search ( plpgsql_bm25 + pgvector ) with Reciprocal Rank Fusion; and Jupyter notebook examples.

jankovicsandras commented on Production RAG: what I learned from processing 5M+ documents   blog.abdellatif.io/produc... · Posted by u/tifa2up
pamelafox · 2 months ago
At Microsoft, that's all baked into Azure AI Search - hybrid search does BM25, vector search, and re-ranking, just with setting booleans to true. It also has a new Agentic retrieval feature that does the query rewriting and parallel search execution.

Disclosure: I work at MS and help maintain our most popular open-source RAG template, so I follow the best practices closely: https://github.com/Azure-Samples/azure-search-openai-demo/

So few developers realize that you need more than just vector search, so I still spend many of my talks emphasizing the FULL retrieval stack for RAG. It's also possible to do it on top of other DBs like Postgres, but takes more effort.

jankovicsandras · 2 months ago
"It's also possible to do it on top of other DBs like Postgres, but takes more effort."

Shameless plug: plpgsql_bm25: BM25 search implemented in PL/pgSQL (The Unlicense / PUBLIC DOMAIN)

https://github.com/jankovicsandras/plpgsql_bm25

There's an example Postgres_hybrid_search_RRF.ipynb in the repo which shows hybrid search with Reciprocal Rank Fusion ( plpgsql_bm25 + pgvector ).

jankovicsandras commented on CARA – High precision robot dog using rope   aaedmusa.com/projects/car... · Posted by u/hakonjdjohnsen
jankovicsandras · 5 months ago
Very cool robot dog and interesting video! Can the dog climb stairs? Isn't capstan drive temperature sensitive, e.g. the ropes will be shorter in cold and longer in warm wheather?
jankovicsandras commented on Show HN: Hyvector – A fast and modern SVG editor   hyvector.com... · Posted by u/jansan
iFire · 7 months ago
The first thing that came to mind was my friend's project to do a SVG editor in Godot Engine https://www.godsvg.com/.

I'll write my first impressions of https://github.com/hyvectorapp so it helps usability and improvement.

Oh it's a freemium app.

hyvectorapp starts off looking like figma, penpot style which is a good sign.

I can export svg [x]

There's no align to grid system.

vector tracing is not generally solved except via the vectormagic product and machine learning research prototypes. I wonder how you solved it.

Can't interact more today so I'll end with this note of hopefulness.

jankovicsandras · 7 months ago
Can this solve vector tracing? https://github.com/jankovicsandras/imagetracerjs (Public domain) Discaimer: I made this.
jankovicsandras commented on Ask HN: Alternatives to Vector DB?    · Posted by u/tmaly
jankovicsandras · 9 months ago
Postgres is a good idea.

Shameless plug: https://github.com/jankovicsandras/plpgsql_bm25

BM25 search implemented in PL/pgSQL, there's also an example of Hybrid (BM25+pgvector) search in the repo.

jankovicsandras commented on BM25 in PostgreSQL   blog.vectorchord.ai/vecto... · Posted by u/gaocegege
jankovicsandras · 10 months ago
This looks cool!

Shameless plug:

https://github.com/jankovicsandras/plpgsql_bm25

BM25 search implemented in PL/pgSQL, might be useful if one can't use Rust extensions with Postgres, e. g. hosted Postgres without admin rights.

jankovicsandras commented on I wrote a static web page and accidentally started a community (2023)   localfirstweb.dev/blog/20... · Posted by u/mooreds
Dwedit · 10 months ago
Hey browser makers, please allow file:// URLS to actually be able to load other files in the same directory without giving a CORS error. You can't even run a JS file from the same directory! That's what's really killing "local first".
jankovicsandras · 10 months ago
I think the following works in Firefox. WARNING: this is a security risk obviously.

about:config -> security.fileuri.strict_origin_policy -> false

source: https://stackoverflow.com/questions/58067499/runing-javascri...

jankovicsandras commented on Building a full-text search engine in 150 lines of Python code (2021)   bart.degoe.de/building-a-... · Posted by u/matt_daemon
jankovicsandras · a year ago
This is a good intro to text search. Shameless plug: If you throw in a bit more, ca. 250 SLOC, you can have BM25 search: https://github.com/jankovicsandras/bm25opt

u/jankovicsandras

KarmaCake day146March 4, 2016View Original