Readit News logoReadit News
kashyapa95 commented on Show HN: Librarian - Semantic Bookmark Search Using Transformers   github.com/oto-labs/libra... · Posted by u/kashyapa95
smusamashah · 2 years ago
Had to move manifest.json from public dir to src dir and load src dir in chrome.

EDIT: And the popup doesn't do anything at all. Can see no js included in popup.html.

kashyapa95 · 2 years ago
my bad, forgot to mention you need run npm build. Added instructions here: https://github.com/oto-labs/librarian?tab=readme-ov-file#set...
kashyapa95 commented on Show HN: Librarian - Semantic Bookmark Search Using Transformers   github.com/oto-labs/libra... · Posted by u/kashyapa95
smusamashah · 2 years ago
How do I install this?
kashyapa95 · 2 years ago
You'll have to do this for now: https://developer.chrome.com/docs/extensions/get-started/tut...

we were gonna get it published on the web store after some initial feedback

kashyapa95 commented on Show HN: Librarian - Semantic Bookmark Search Using Transformers   github.com/oto-labs/libra... · Posted by u/kashyapa95
iansinnott · 2 years ago
Curious how this works. I've experimented with in-browser vector search using victor[1] with mixed results. Hadn't heard of this orama lib before checking out your project.

[1]: https://github.com/not-pizza/victor

kashyapa95 · 2 years ago
So the extension scrapes all your bookmarks' content, selects key parts of it (we have a naive heuristic for now), embeds them using Sentence Transformer, and indexes them in the browser local storage with Orama's vector DB. When you want to search, we embed the query and do a vector search against the index to get the semantically most similar ones. All in-browser so no data going to any API.

Didn't try victor, is that just for nodejs runtime or does it run at the edge as well? Orama's been pretty good, at least semantically. Haven't done any speed benchmarking so not sure if it's as fast as say HNSW.

kashyapa95 commented on Show HN: Librarian - Semantic Bookmark Search Using Transformers   github.com/oto-labs/libra... · Posted by u/kashyapa95
mshekow · 2 years ago
Sounds really interesting, and I'd also love a Firefox version :)
kashyapa95 · 2 years ago
Will try to make one soon!
kashyapa95 commented on Show HN: Librarian - Semantic Bookmark Search Using Transformers   github.com/oto-labs/libra... · Posted by u/kashyapa95
visarga · 2 years ago
This is great but I want just simple full text search on all the history. Not title and url search, but full text. If it has semantic embeds on top, all the better. I am losing too many of the things I find.

Wondering why browsers neglected bookmarks and search history so much. They never progressed in the last 2 decades. Storage is cheap, computers are fast and multi-core, yet we live with the mentality of paucity and don't save our digital crumbs.

kashyapa95 · 2 years ago
Thank you! Yeah history is something we've been asked multiple times now. I'm sure this could be extended easily once we solve a few things (scraping pages faster and being smarter about what text we embed, parsing out irrelevant stuff). Will keep you posted.
kashyapa95 commented on Show HN: Librarian - Semantic Bookmark Search Using Transformers   github.com/oto-labs/libra... · Posted by u/kashyapa95
klavinski · 2 years ago
I also built such an extension half a year ago [0]. The first iteration was a local-first natural-language full-text search for the browser history [1]. The second iteration was focusing on bookmarks [2].

None of these could spark enough interest to get feedback on what users want. I am sharing this experience so that you may study my attempt, if you want.

[0] https://getpinbot.com/

[1] https://www.youtube.com/watch?v=GYwJu5Kv-rA

[2] https://www.youtube.com/watch?v=PQh1qhvxZzc

kashyapa95 · 2 years ago
Thank you! Will take a look at these. We mostly made this to solve something for ourselves and as a learning exercise, but hoping this may resonate with others too

u/kashyapa95

KarmaCake day29December 19, 2021
About
https://twitter.com/itsakshayyyyy
View Original