Readit News logoReadit News
Kortaggio · 4 months ago
This write-up about the site is also fascinating: https://pudding.cool/2025/07/street-view/
Antrikshy · 4 months ago
The Pudding is one of the best things on the internet today.
dang · 4 months ago
Added to top text. Thanks!
archon810 · 4 months ago
Holy crap, I didn't know sites like these still exist on the web.

Deleted Comment

deanc · 4 months ago
This would be an interesting additional layer for google maps search which I often find to be lacking. For example, I was recently travelling in Gran Canaria and looking for places selling artesan coffee in the south (spoiler: only one in a hotel which took me almost half an hour to even find). Searching for things like "pourover" and "v60" is usually my go-to signal but unless the cafe mentions this in their description or its mentioned in reviews it's hard to find. I don't think they even index the text on the photos customers take (which will often include the coffee menu behind the cashier).
robertlagrant · 4 months ago
Seems like searching for V60 would get you a lot of Volvos! Is anyone photographing these words in coffee shops that would let them be surfaced here?
deanc · 4 months ago
Yeah, that can be somewhat of a problem in bigger cities ;-) It's pretty common for people to have taken a photo of the menu in cafes but as mentioned it seems google isn't ingesting or surfacing that information for text search.
mockingloris · 4 months ago
It could be. If they didn't think about it, now they can.

Could easily seeing myself come back to this.

└── Dey well; Be well

m_kos · 4 months ago
GitHub of the person who prepared the data. I am curious how much compute was needed for NY. I would love to do it for my metro but I suspect it is way beyond my budget.

https://github.com/yz3440

(The commenters below are right. It is the Maps API, not compute, that I should worry about. Using the free tier, it would have taken the author years to download all tiles. I wish I had their budget!)

LeifCarrotson · 4 months ago
I would wager the compute for the OCR is cheap. Just get a beefy local desktop PC, if it runs overnight or even takes a week that's fine.

It's the Google Maps API costs that will sink your project if you can't get them waived as art:

https://mapsplatform.google.com/pricing/

Not sure how many panoramas there are in New York or your metro, but if it's over the free tier you're talking thousands of dollars.

daemonologist · 4 months ago
The linked article mentions that they ingested 8 million panos - even if they're scraping the dynamic viewer that's $30k just in street view API fees (the static image API would probably be at least double that due to the low per-call resolution).

OCR I'd expect to be comparatively cheap, if you weren't in a hurry - a consumer GPU running PaddlePaddle server can do about 4 MP per second. If you spent a few grand on hardware that might work out to 3-6 months of processing, depending on the resolution per pano and size of your model.

swores · 4 months ago
Their write up (linked at top of page below main link, and in a comment) says:

> "media artist Yufeng Zhao fed millions of publicly-available panoramas from Google Street View into a computer program that transcribes text within the images (anyone can access these Street View images; you don’t even need a Google account!)."

Maybe they used multiple IPs / devices and didn't want to mention doing something technically naughty to get around Google's free limits, or maybe they somehow didn't hit a limit doing it as a single user? Either way, it doesn't sound like they had to pay if they only mention not needing an account.

(Or maybe they just thought people didn't need to know that they had to pay, and that readers would just want the free access to look up a few images, rather than a whole city's worth?)

ks2048 · 4 months ago
It says 8 million images. So, 13.2 images/second for one week.

I'm wondering about more the data - did they use Google's API or work with Google to use the data?

puppymaster · 4 months ago
i just hashout out the details with claude. apparently it would cost me ~8k USD to retrieve all Taipei street images from gmap api with 3m density. Expensive, but not impossible.
ge96 · 4 months ago
Tangent, there are these videos on YT of people walking through cities, the ones I like in particular are through Tokyo/Japan. I was thinking it would be cool to build a 3D map from that, it is possible but not my field. I think some companies have done it too. But there is a lot of data on that. Maybe free robot training (walking through a crowd like delivery).

I believe it's a combo of SLAM/photogrammery/VIO but you don't have an IMU so that part would have to be estimated from the video. Maybe the flickering of the lights with the frames probably too fast.

ex. https://youtu.be/ohlzQNCpT7M?si=zH764fDlHqPKyjin&t=537 ex. https://www.youtube.com/watch?v=UZi2GeEGdvM

at-fates-hands · 4 months ago
There was a guy a long time ago, who did YT videos of the tech markets in Tokyo and it was really surprising some of the best places to get parts for smartphones or robots were completely non-descript buildings in the heart of the city. He specifically went to places that most people wouldn't know about unless you really had great local information.

If someone were to do what you're saying, it would be a huge win for people visiting and being able to find these places. I would love to see this.

ge96 · 4 months ago
You reminded me of Strange Parts who was in China, able to pick up random stuff like an iPhone motherboard from a lady selling it on the street.
quinncom · 4 months ago
Do you think those videos of tech markets in Tokyo are still online? I would enjoy seeing them.
pimlottc · 4 months ago
Similarly it would be great to have a tool to do it with stills, like reconstruct a floor plan based on real estate photos. Even if it were partially manually, it would be pretty handy.
ge96 · 4 months ago
Matterport seems to do this at least offering you a 3D tour of say an apt complex

edit: although this is not what you're describing, this is literally using a 360 camera

Apple's Room Plan is pretty legit measuring walls/objects in a room but also requires being in the room/moving it around

baby · 4 months ago
Interesting how they censor the word "fuck" like it's going to affect your brain if you read it fully spelled or something
sksrbWgbfK · 4 months ago
Is it? I can lookup that word and see it in the pictures. Or is it the StreetView version that has been censored somewhere?
yorwba · 4 months ago
The pudding.cool article has a link labeled "View the map of “F*ck”" but it leads to a search for "fuck" instead. If you search for "F*ck", you find gems such as "CONTRACTOR F CK-UP" https://www.alltext.nyc/panorama/KhzY08H72wV2ldXamZU5HA?o=76... (Strategically placed pole obscuring the word.)
vgb2k18 · 4 months ago
SEO, or family friendly values (maybe both!). Related: no swearing in the first minute of YouTube videos.
rancidcrab · 4 months ago
That's been changed (again). Iirc most swear words are now fine wherever they are in the vid.
baby · 4 months ago
Is that a youtube policy? It's so weird.

Dead Comment

dang · 4 months ago
Related. Others?

All Text in NYC - https://news.ycombinator.com/item?id=42367029 - Dec 2024 (4 comments)

All text in Brooklyn - https://news.ycombinator.com/item?id=41344245 - Aug 2024 (50 comments)

dfworks · 4 months ago
I have a London one also if anyone is interested!

https://london.publicinsights.uk

dang · 4 months ago
Nice! If you want to email hn@ycombinator.com we could send you a repost invite for https://news.ycombinator.com/item?id=44664046 - but please wait a while first. The trick is to let enough time go by for the hivemind caches to clear. Then everything old becomes new again :) - usually 2-3 months is a good interval...
jjwiseman · 4 months ago
This is a super cool project. But it would be 10x cooler if they had generated CLIP or some other embeddings for the images, so you could search for text but also do semantic vector search like "people fighting", "cats and dogs, "red tesla", "clown", "child playing with dog", etc.
wilson090 · 4 months ago
This would probably make John Wilson's job a lot easier (https://en.wikipedia.org/wiki/How_To_with_John_Wilson)