Readit News logoReadit News
danso · 2 years ago
Not sure how this is a new launch when it's been out of beta since 2020 [0], though good to see it get more visibility as I imagine it's not currently heavily used.

A comparison between Google's dataset search and OpenDataNetwork (which focuses on public data portals)

https://www.opendatanetwork.com/search?q=salary

https://datasetsearch.research.google.com/search?src=0&query...

Google covers more sources, though usually in the case of public sector info (e.g. salaries) I'd prefer just to see the actual sources and not a bunch of Kaggle matches thrown in.

[0] https://searchengineland.com/google-datasets-search-is-out-o...

acutesoftware · 2 years ago
I think Google has lost all faith in terms of keeping projects around - especially when they involve data locked into a mildly complex system without a complete migration path out.

I would be wary investing time in learning / using any new products from them.

Deleted Comment

bluecoconut · 2 years ago
Curious for those that are reading comments here - 1. Are you users of google dataset search? 2. What other dataset searches are you using?

At my company, we recently (last year) did a crawl of over 600 million tables (TabLib) and released it. We've done some indexing, but haven't released a public search for it yet. If people were interested, we could stand up a service pretty quickly that serves a larger number of data sources (with data directly accessable, rather than behind potential paywalls like google dataset search). We can also attach it to our Data AI products (eg. chatgpt integration with it, to do analysis and plotting with it).

Super curious for feedback and signal for interest in a product like this, so please let me know - reach out here or directly via email if you have interest and why!

jabroni_salad · 2 years ago
Seems like it hasnt crawled any sports-related datasets. At least not for baseball. You can get things like ticket sales but not anything like batting averages.
liminal · 2 years ago
Really wish you could filter by dataset attributes like language, number of rows, number of columns, etc.

Deleted Comment