Readit News logoReadit News
stayml commented on Show HN: Proxy.exposed – a simple tool for checking how anonymous a proxy is   proxy.exposed/... · Posted by u/stayml
micahdeath · 3 years ago
I like the concept, but it also looks like a mining site where you let people send their proxies to you. Is the code Open Sourced for this, could it be self-hosted?
stayml · 3 years ago
Thanks and good point! (it's not happening, but I can understand the concern). I'll think about open sourcing the code for people to self-host.
stayml commented on A self-updating list of the most current useragents   useragents.me... · Posted by u/stayml
jeroenhd · 3 years ago
How do you control for the faked user agents scrapers use? A bad scraper will be much more common in your logs than a well-behaved browser, after all.
stayml · 3 years ago
I filter out any user agents that are invalid, but there's no way to see which are real or faked. The access logs include the useragent of every single site visitor - not only errors/bad actors.
stayml commented on A self-updating list of the most current useragents   useragents.me... · Posted by u/stayml
kgeist · 3 years ago
>macOS 10.15 Catalina Microsoft Edge 108

Do macOS users actually use Edge or it's an error?

stayml · 3 years ago
Hm! It could just be a parsing error. The useragent in question is like this: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/108.0.0.0 Safari/537.36 Edg/108.0.1462.41
stayml commented on A self-updating list of the most current useragents   useragents.me... · Posted by u/stayml
modeless · 3 years ago
Where are Android and iOS?
stayml · 3 years ago
Coming soon! I separate them out for now as most scraping tasks require either desktop or mobile useragents and not both together
stayml commented on A self-updating list of the most current useragents   useragents.me... · Posted by u/stayml
black3r · 3 years ago
The OS column comes off as a bit deceptive, because windows 11 still presents as Windows NT 10.0 and newer versions of macOS still present themselves as "Intel Mac OS X 10_15_7", even if you're on an ARM chip...,

If you're parsing this info into the OS column, you should probably display this info in there at least with a note, or something like "Windows 10/11", "macOS 10.5 or newer"

stayml · 3 years ago
Good point, thanks. I'll add that in
stayml commented on A self-updating list of the most current useragents   useragents.me... · Posted by u/stayml
playingalong · 3 years ago
AFAICT they are not trying to provide current browser usage stats. They are only after having a list of popular browsers which is only a subset of the former. The last paragraph about scraping seems to indicate that.

So they only need what is a reasonable UA as of this week. They don't need what is the most popular one.

stayml · 3 years ago
Yes, this too. It should just be a -passable- sample of what's popular and seen on the web
stayml commented on A self-updating list of the most current useragents   useragents.me... · Posted by u/stayml
dj_mc_merlin · 3 years ago
First of all, good job on the project. I do not mean to be too negative, but this:

> The site updates weekly with data sourced from the server access logs of another site I run in order to give an accurate picture of the devices and browsers being used on the web.

If you only source data from one website, it is not an accurate picture by any means. Consider that other websites have different user bases. Your Firefox usage seems way too high for example, it's likely your other website is largely used by other technical people.

stayml · 3 years ago
Thanks! And yep, fair comment, and I had noticed this as well even more so in last week's list. I have been thinking about how I could adjust the numbers in some way to counteract this or add another data source.

u/stayml

KarmaCake day165November 15, 2022View Original