Readit News logoReadit News
optimalsolver · 4 years ago
Calvin & Hobbes & Dune (Calvin & Muad'Dib):

https://calvinanddune.tumblr.com/

Arubis · 4 years ago
This is incredible; thank you!
sp332 · 4 years ago
This page is great and I've used it for years. One thing to note is the underlying data (from gocomics) is not always accurate, and sometimes you will end up on a comic that does not match the search. Sometimes you can find the one you're looking for by browsing about a week out in either direction. If you email Yingling about it with the correct date or URL, he'll update the database.
gk1 · 4 years ago
> Your search returned no results.Remember that it searches for the EXACT phrase you typed! (And dates must be of the format MM/DD/YYYY)

Requiring an exact search is fairly prohibitive. For example I searched for "feeling frustrated" and got the above error, despite 100% certainty there are strips about feeling frustrated. You (or someone) could update this to search by semantics and not by keyword, using a combination of SBERT (or similar model) + a vector database: https://www.pinecone.io/docs/examples/semantic-text-search/

marginalia_nu · 4 years ago
I think this is serious overkill, like you can get a way with simple stemming.
snapdeus · 4 years ago
I believe there is built in stemming in this search.

I searched the key word "commit" and some results were returned with the word "committed"

My guess is this guy just set up a simple DB query on the search form and left it at that.

That's what I've done in the past for simple text search.

I know that at least mongodb comes with built in full text search and stemming.

What he really should do is set up elastic search but that's a big pita to learn all that

leobg · 4 years ago
Or lemmatization and BM25. Often better than SBERT in my experience when using it out of the box.
dakial1 · 4 years ago
Felt the same way. Some off the shelf algorithms would do wonders here. More sophisticated ML to tag (image and themes) would be awesomely overkill.
bombcar · 4 years ago
Or allow people to tag strips, etc.
dylan604 · 4 years ago
shudders at the thought of what "people" would contribute
dang · 4 years ago
Related:

Calvin and Hobbes Search Engine - https://news.ycombinator.com/item?id=26119380 - Feb 2021 (163 comments)

Calvin & Hobbes Search Engine - https://news.ycombinator.com/item?id=1600211 - Aug 2010 (27 comments)

Calvin & Hobbes "Search Engine" - https://news.ycombinator.com/item?id=1600151 - Aug 2010 (5 comments)

fzliu · 4 years ago
I remember this from a while back - good to bring back memories. This may be a bit excessive, but using CLIP (https://towhee.io/image-text-embedding/clip) or some other multimodal learning model in conjunction with Milvus (https://milvus.io) could make for a search engine that takes the graphics into consideration as well. Not sure how well it would work for comics, but it could make for a pretty unique weekend hack.
unfunco · 4 years ago
I searched for "Somehow, it's always right now until it's later." and it found nothing, I checked on http://www.s-anand.net/comic.calvin.jsz and it's "Somehone, it's always right now until it's later." – it's not very forgiving.
teddyh · 4 years ago
jkingsman · 4 years ago
If anyone else is looking for the dataset, it appears here: http://www.s-anand.net/comic.calvin.jsz
irrational · 4 years ago
I don't think this is correct. For instance, if I search for Susie, it finds Susie comics even when she isn't mentioned by name.

For instance, the first comic with her has this information

Script Here comes that new girl. Hey Susie Derkins, is that your face, or is a 'possum stuck in your collar? I hope you suffer a debilitating brain aneurysm, you freak! She's cute, isn't she?? Go away. Description Calvin sees the new girl coming and yells a question asking if that's Susie Derkins' face or if a possum is stuck in her collar. He then yells that she should have a debilitating brain aneurysm. Hobbes says she's cute, to which Calvin wants Hobbes to go away.

None of which is in the dataset you listed.