Not sure how this is a new launch when it's been out of beta since 2020 [0], though good to see it get more visibility as I imagine it's not currently heavily used.
A comparison between Google's dataset search and OpenDataNetwork (which focuses on public data portals)
Google covers more sources, though usually in the case of public sector info (e.g. salaries) I'd prefer just to see the actual sources and not a bunch of Kaggle matches thrown in.
I think Google has lost all faith in terms of keeping projects around - especially when they involve data locked into a mildly complex system without a complete migration path out.
I would be wary investing time in learning / using any new products from them.
Curious for those that are reading comments here -
1. Are you users of google dataset search?
2. What other dataset searches are you using?
At my company, we recently (last year) did a crawl of over 600 million tables (TabLib) and released it. We've done some indexing, but haven't released a public search for it yet. If people were interested, we could stand up a service pretty quickly that serves a larger number of data sources (with data directly accessable, rather than behind potential paywalls like google dataset search). We can also attach it to our Data AI products (eg. chatgpt integration with it, to do analysis and plotting with it).
Super curious for feedback and signal for interest in a product like this, so please let me know - reach out here or directly via email if you have interest and why!
Seems like it hasnt crawled any sports-related datasets. At least not for baseball. You can get things like ticket sales but not anything like batting averages.
A comparison between Google's dataset search and OpenDataNetwork (which focuses on public data portals)
https://www.opendatanetwork.com/search?q=salary
https://datasetsearch.research.google.com/search?src=0&query...
Google covers more sources, though usually in the case of public sector info (e.g. salaries) I'd prefer just to see the actual sources and not a bunch of Kaggle matches thrown in.
[0] https://searchengineland.com/google-datasets-search-is-out-o...
2021
https://news.ycombinator.com/item?id=27068551
2020
https://news.ycombinator.com/item?id=22130874
2018
https://news.ycombinator.com/item?id=17919297
I would be wary investing time in learning / using any new products from them.
Deleted Comment
At my company, we recently (last year) did a crawl of over 600 million tables (TabLib) and released it. We've done some indexing, but haven't released a public search for it yet. If people were interested, we could stand up a service pretty quickly that serves a larger number of data sources (with data directly accessable, rather than behind potential paywalls like google dataset search). We can also attach it to our Data AI products (eg. chatgpt integration with it, to do analysis and plotting with it).
Super curious for feedback and signal for interest in a product like this, so please let me know - reach out here or directly via email if you have interest and why!
Deleted Comment