Internet search tips - Readit News

+1 for his tip of "Search The Internet Archive".

The other day I searched for the Tom & Jerry full episodes on the web to no avail (streaming platforms and video platforms like youtube only have rubbish cuts).

The internet archive has every episode starting from the first one in 1940, in an easily accessible player without any ads or recommendations: https://archive.org/details/tom-and-jerry-all-114-episodes

nomilk · 2 years ago

TIL: IA is more than Wayback Machine.

Something I'd love to learn to do better is search WBM. I use WBM only a couple of times per month, but when I need it it's the only tool that can do the job and is therefore very valuable. Trouble is, I don't really know how to search it unless I have a record of the exact URL I want, which isn't always possible.

andrepd · 2 years ago

Your comment just reminded me of how great the Internet Archive is, so I guess it's time to donate another 100 bucks to IA again.

WaffleIronMaker · 2 years ago

Just as a quick link for others, it's possible to set up a single or monthly donation to the internet archive at this link: https://archive.org/donate/

bookofjoe · 2 years ago

I give from time to time to Wikipedia, when I get an appeal from Jimmy Wales. Even more important than IA to me, which is saying A LOT.

thakoppno · 2 years ago

it’s within the realm of a few apple stores to serve the internet archive?

ghghgfdfgh · 2 years ago

Internet Archive is generally invaluable for finding older media. I personally use it to find movies from the 40’s and earlier. There’s always a dozen different uploads of the movie regardless of whether it’s still under copyright protection. I guess the rights owners don’t care that much, but it could potentially be ammo to take the whole Archive down.

my problem is some operators just dont work any more, either on purpose or because of crappy quality control. for example, you used to be able to do:

    allintitle:Neil Diamond If You Go Away

on YouTube, and get exactly what you would think, results with all those words in the title. but now, you dont:

https://youtube.com/results?search_query=allintitle:Neil+Dia...

now, I get crap like this:

    Neil Diamond & Shirley Bassey - Play Me - "high quality"
    Barbra Streisand - If You Go Away (Ne Me Quitte Pas)

how is that what I searched for? also, what is this:

> A search for [site:nytimes.com] will work, but [site:nytimes.com] won't.

https://support.google.com/websearch/answer/2466433

did I just have a stroke? those two searches are exactly the same. I try to be understanding, but I am constantly tripping over big companies glaring software and/or documentation issues, it gets old.

hddqsb · 2 years ago

> https://support.google.com/websearch/answer/2466433

> those two searches are exactly the same

Yes, that appears to be a recently-introduced typo -- the archived version from April does have a space: https://web.archive.org/web/20230412181331/https://support.g...

I submitted a feedback comment, hopefully they'll fix the typo.

(For future reference, here is a snapshot of the current version without a space: https://web.archive.org/web/20230722085337/https://support.g...)

bdn_ · 2 years ago

Has anyone else experienced DuckDuckGo ignoring the exclusion operator? For example, searching `kiwi -fruit`, with no space between the hyphen and second word, used to bring up results that did not include the word "fruit". This no longer seems to be the case.

mdp2021 · 2 years ago

> duckduckgo ... exclusion operator

Removed a few weeks ago.

Somebody posted in these pages the github diff showing the removal of the options.

lloydatkinson · 2 years ago

Someone needs to build a meta search engine…

chkal · 2 years ago

> did I just have a stroke? those two searches are exactly the same.

I just had the exact same though while reading the page.

nomilk · 2 years ago

The fact Google's own documentation is this poorly kept is a astonishing.

Makes me a firm believer companies should have their documentation on GitHub (or similar) so anyone can make a PR to tidy these things up.

pdanpdan · 2 years ago

The one not working has a space after the colon. It's even in the section taking about this :)

38 · 2 years ago

no, it doesn't.

Daub · 2 years ago

Agree. This even applies to simple Booleans, which are sometimes completely ignored.

asteroidz · 2 years ago

I sometimes wonder about PMs who sign off on decisions like these, and the seeming lack of protest the developers put up.

Jakob · 2 years ago

orphea · 2 years ago

I feel like Google became absolutely unusable as a search engine. You search for "keyword1 keyword2", and at least half of the results are either "Missing: keyword1" or "Missing: keyword2".

ImaCake · 2 years ago

Until a year ago using DuckDuckGo instead of google felt like only an equal or inferior option. But at some recent point I have found that DDG has slightly improved and Google has gotten much, much worse. With DDG and Bing Chat, google's future is looking very Internet Explorer 6.

flyinghamster · 2 years ago

DDG is following Google's path, with removal of the exclusion operator. Nothing like searching for "foo -quux" and finding "quux" in ALL of my search results, on DDG and Google alike.

discobean · 2 years ago

Make your tool so idiots can use it and only idiots will use it

mrweasel · 2 years ago

Google is especially bad that this. Often what will be remove is the most important keyword, I assume because that yields more result.

cma · 2 years ago

They moved over to really dumb word vector stuff, they mention it in the TPUv4 paper and it is pretty surprising but probably monetized better somehow.

costco · 2 years ago

Really good list. Here are some others I've discovered:

Some libgen clone sites like z-lib have fulltext search on books with support for exact matches: https://zlibrary-asia.se/fulltext/?q=%22frank+sinatra%22&typ...

Even if you are going to purchase book on a subject, this finds so much stuff that is not in Google because of copyright delisting and is sometimes useful in knowing which books to consider.

Yandex results especially the non English ones can be good if you are willing to use a translator.

If I had my way, I would introduce internet search techniques as a core module in all university programs.

Too often I see in my students work evidence of lazy searching. It is as if they expect Google to be able to read their minds or even foresee their future intentions. Lack of variety of search terms is a key shortcoming. Also, lack of exploration of terms which are tangentially related to their search topic.

An internet search should be playful and exploratory. Above all, it should be understood that the internet is beyond simple linear indexing.

aardshark · 2 years ago

But search techniques change over time. You can see that in the responses in this thread. What used to work doesn't anymore.

userbinator · 2 years ago

It is as if they expect Google to be able to read their minds or even foresee their future intentions.

That's exactly what Google and other interests want --- gullible, uncritical sheeple that can be exploited to extract $$$ and worse.

hattmall · 2 years ago

So I actually did have this as a class, or at least part of one.

I think there is / was an official standard or name, I can't remember it though. It never worked with Google really, but it worked on the search engines for our university and other academic sites.

skimdesk · 2 years ago

Another useful Google search trick not mentioned in the article is numeric range queries.

You can use two numbers separated by two dots to represent all numbers in the range.

For example, a search for

  taki 100..200

gives me results for taki 183.

This is useful when you can't remember an exact year or number.

dang · 2 years ago

Internet Search Tips - https://news.ycombinator.com/item?id=26847596 - April 2021 (77 comments)

Internet Search Tips - https://news.ycombinator.com/item?id=18666574 - Dec 2018 (28 comments)

coldblues · 2 years ago

Gwern is the person I'd become if I could properly monetize my ADHD and rabbit hole seeking behavior. I absolutely love his website and everything he publishes, and I am always delighted to look over his new stuff. Wish there are more people like him to read from.

RGBCube · 2 years ago

His website is the best designed website I have genuinely ever visited, so fast, easy to navigate and very little blank space which makes it information dense. I also love the inline link opening, I definitely will be implementing that when I make my own website in the future.

wintermutestwin · 2 years ago

>His website is the best designed website

The forced R and L margins suck for people with oldster eyes who have to increase the text size.

Kids these days!

jterrys · 2 years ago

Are you perchance able to provide some kind of RSS feed to his website? I'm having a hard time finding his newest stuff. You can subscribe to his newsletters but he stopped doing those two years ago

Tenoke · 2 years ago

There is the firehose from his patreon though that might not be exactly what you are looking for. Personally, I just check 'newest' on his frontpage every now and then.

https://www.patreon.com/gwern