Besides the humor of how Lyft's filter flagged traditionally Caucasian names like "Cummings" – in addition to the usual issues with non-Western names, e.g. Pimpong and Poon – I'm fascinated/confused how this made it into production? The user database already exists and was currently being used by the live application. Before deploying this new filter onto the production database, wouldn't you do a dry run to get not only the count of users who will be flagged and notified, but a listing of frequently flagged names? Which you could easily manually eyeball to make sure there weren't obvious false positives?
With a userbase as big as Lyft's, I'm sure there were a ton of obvious true positives (anyone named "Fuck", ostensibly). I just can't believe they didn't notice a surname as relatively common as Cumming/Cummings. Yes, the apparent naivety of the regex is an issue, but this seems like a system for which a lot of testing on actual data would be easy and natural to do as part of the QA process.
It always amazes me when US Americans talk in terms of human races like it's the most normal thing in the world. And no, “Cummings“ is not a “Caucasian name“ (whatever that is supposed to mean), it's an English name. I don't want sound too harsh, it's just that as a Central European with all our recent history, things like this really rub me the wrong way.
As an American I agree and this was also my reaction to the comment.
There are a number of these things, though. I personally get a little irritated with the way many people here use terms like "Hispanic" to contrast with "white people". To me, "Hispanic" is a wide spectrum of ethnicities but really more of a linguistic, cultural and national/political specifier, with lots of overlap with "white". People who are trying to be correct about this will pull out complicated phrases like "non-hispanic whites" which may be correct but are also clumsy, reflecting the absurdity of the way we categorize this country's social boundaries in the first place. "White" itself has had a fluid definition in the US over time, previously having excluded southern or eastern Europe, Jews, or even the Irish.
I know that in my own recent ancestry I have at least 4 European ethnicities, some of which would have been ridiculed by other Americans in the past. But this nuance washes away and I am a "white guy" which lumps me together with cultures I don't particularly identify with. There is no sense in denying, for example, that I am less discriminated against by appearance than other groups. But I do not think either that there is one homogeneous "white people" living in the United States.
Caucasian is a funny word to apply to Europeans since its a part of the area around present day Georgia and is a highly diverse ethnic mix of people which most people would classify as Central Asian (like Turks and Armenians for instance). Not exactly what you would think would be "white" which is what many Americans assume it means.
Race is not a uniquely American concept. The Germans were the ones who first came up with the idea of a Caucasian race back in the 18th century. You're not wrong though, we should use White European instead since that's what it means today.
Yeah, no name filter is going to be perfect for all edge cases. But presumably, it's a lot cheaper to have the Lyft support team handle email tickets from the relatively few Professor Fucks, than to have them also deal with every Cummings and Dick.
This is all bullshit anyway. A word is just a word and whenever it insults somebody it's their own problem. I would rather change my actual name to Fuck if I wasn't too lazy to deal with problems like that.
I once read a story of a Vietnamese man named Hui. He had to go to the court to protect his right to be named this way in Ukraine where the word is used widely and has only one meaning - a dick.
A word is just a word and whenever it insults somebody it's their own problem.
There was a study linked to on HN a couple of years ago that found that babies recognize swear words even before they have language skills, and that people can sometimes recognize swear words in languages they don't speak. It has to do with the tone and sharpness of the sounds.
> A word is just a word and whenever it insults somebody it's their own problem.
That's what you think the world should be like, not the way it is. If your name is Adolfhitler Smith, a perfectly legal name in the US, then you will have trouble getting hired.
That's a made up example, but there's evidence[1] that unusual (not necessarily obscene) names will lead to less success in job applications.
If you have a "bad" name, it's your problem well before anyone else's.
This is a nonsensical argument. A word has meaning, which is why it is a word and not a sound.
If I yell insults at you, you are saying that doesn't mean anything? And if you get insulted, it is your own fault?
No, words have meaning based on a common understanding between the person saying the word and the person hearing the word. If I insult you and you understand it, it is me who did the insulting, it is not 'your problem'
You should try this argument out in court: "No, your honor... I can't be guilty of threatening to kill someone.. sure I said, 'I am going to kill you tomorrow at 5pm', but those were just words! If the person felt threatened, that is on them!"
I just tried to create a driver's account on Lyft's website. There's an initial page where you give them your name. And then, deeper into the process, a form for supplying your name as it officially appears on your driver's license.
Which makes sense. There's likely a lot of drivers who want to go by "Tom" instead of "Thomas" or whatever their full official name is – which is even more important for drivers with very uncommon names who want to go by something more familiar/pronounceable. There's probably a decent safety argument for not requiring drivers to have their full real names be accessible to the user, especially in cases where a driver has an uncommon (i.e. easy to doxx) name.
None of the major card networks except for American Express validate cardholder name. The Address Verification Service (AVS) also only validates numeric values (so if your address is 123 Main Street, 123 Maple Ave will return a matching response)
A credit card is not required to use Lyft. You can use credit from gift cards or promo codes, as well as 1 time payment methods like Apple Pay which are completed at time of ride booking and do not expose names
I think you and the driver are supposed to exchange names. This is probably awkward when you name yourself "suck my balls" or something, which people undoubtedly do because people are terrible.
I’m guessing test and prod are highly-segregated to prevent improper data access. Remember when one of the app-based taxi companies had employees who spied on their exes via their system? I’m guessing they beefed up security to mitigate the insider threat risk.
Regardless, your point still stands. The product manager should have gone through whatever process was needed to pull down a limited set of production data to test against. Or the engineers could have logged exceptions in prod before making the filter active.
What's the upside of deploying this anyway? It's not as though it's similar to Facebook where these names are very publicly visible. It's just the drivers who would see the names.
They could just generate the list of questionable names and get support to give them a call about it or institute a rider identity verification programme.
I have a friend named Aryan who was not allowed to sign up for a Microsoft account with that name. Support even refused to make the exception when asked, suggesting that he sign up with the name "Ryan" instead.
To this day, emails from that account have "Ryan <lastname>" in the header.
> Which you could easily manually eyeball to make sure there weren't obvious false positives?
Implying devs test their code thoroughly. lol. After working at numerous companies (some with high bars of entry) I've noticed many devs push changes without testing properly. Or, if they test, they rarely consider edge cases or implications of what they're doing and only test the obvious base cases.
As someone already pointed out here, Fuck is a real last name. Fucker is also a real last name. You need a credit card to pay for Lyft; I don't quite follow why they can't just match people up with their credit card data.
It seems like tech companies have been doing DIY profanity filters. Apple has a profanity filter for engraving text onto a device in the online store (https://twitter.com/minimaxir/status/1213188371452841984 ); intended to be your name, so terms like "Dick" have to not be filtered.
I learnt that Apple has a validation endpoint. Since the engraving service was recently updated, I took a profanity wordlist and checked it against the endpoint just for fun. The results are...counterintuitive: https://pastebin.com/mzpECiQw (NSFW language, obviously)
A bit of a tangent but you reminded me-- and maybe, I'm totally full of shit or like my memory is wrong, but the iPhone predicting profanity was my favorite feature. For YEARS it would never write "duck" instead of "fuck," and generally was great, but then (and this is the like maybe I'm totally wrong part) after Jobs passed away that stopped completely... and now I get this bullshit I have to fucking correct all the time (I have had to set up shortcuts to prevent it from fixing my profanity).
I can't imagine Jobs putting up with trying to swear at someone over text and repeatedly getting "ducking."
iOS autocorrect has always been terrible, it’s not exclusive to profanity. Both in terms of its performance and experience. Its incredibly frustrating to use when it autocorrects words 3-4 words later, and navigating text fields has become significantly more difficult on recent versions of iOS.
Android is the superior experience in almost every UX category imo, and text handling is the best example of that.
By the way Semen is the way the Russian equivalent to Simon is spelled officially. There is quite a number of Russian men who have just that for their first name in their passports. I once met one, he worked as a developer and people were making fun of his name.
The Apple endpoint, inconsistent as it was, was a little more permissive than I expected. At least compared to something like the Sony Playstation ID filter; for example, 'hitler' is fine for Apple, whereas Sony seems to block out any ID containing the literal string of 'hitle', e.g. `hitle123987` [0]. Meanwhile, Apple manages to block some of the more esoteric sexual profanity that Sony's filter can miss, e.g. `bunghole`.
I'm still trying to work out what Sony's filter doesn't like about my username (zeta0134) which I've had for ages. The first bit is just the letter "z" in Spanish; there's no deeper meaning. To my knowledge it's not considered offensive in Spanish, but Sony would not let me have it no matter how many variations I tried. Maybe there's some political thing I'm not aware of?
Anyway, I switched languages and I'm "zed0134" on their network. The whole thing struck me as a bit odd.
The profanity filter in the Apple App Store was [0] amusingly wrong, but fortunately also a soft filter rather than a hard filter.
What happened was, the German localisation of the app description included the word “Knopf”. Knopf is not a rude word, according to any German I’ve discussed this with — it is one translation of “knob” in the sense of “button”, but Apple’s naughty word detector clearly thought it was “knob” in the sense of the euphemism for a body part.
It didn’t stop the app passing review, but the automatic warning was still a regular part of updates for that particular app.
Honestly if filter try to filter out euphemisms all is lost because:
1. People who use them for that just come up with new ones all the time
2. People still use the word in the regular sense, like would I now have to come up with a euphemisms to describe a door knob ?!
3. How I wore is my decision. As long as I don't hurry anyone intentionally or knowingly no person and even less company had the (moralic) right to constraint me. (Through wrt. Minirs, Y least younger ones, their parents opinion matters, too.)
This is what happens whenever someone decides that they have enough expertise in something they don't.
We see the same phenomena with encryption, authentication, and password security. People just tend to have a myopic understanding of the topic and vastly overestimate their ability in something they've barely even studied, which is the worst kind of ignorance. And they very much deserve to be named and shamed when they inevitably screw it up.
When dealing with names & addresses, consult an expert.
My last name “Hung” was blocked by Apple’s Genius Bar at one point when they displayed the names queue on the digital board. They don’t like certain methods of Chinese romanization (particularly the one used in Taiwan).
How do they not realize what an obvious shitstorm this will cause?
And who is bothered by this, really?
I find it hard to imagine that it's the drivers calling themselves Dick Cummings out of immaturity. So what does it really matter what the customers call themselves?
>I find it hard to imagine that it's the drivers calling themselves Dick Cummings out of immaturity. So what does it really matter what the customers call themselves?
That was my thought too. It's one thing if you're using obscenity in an anonymous online alias, as a great many Redditors seem to love doing, but I'm struggling to believe that it's something people do in signing up for real-life ride-sharing apps, either as a driver or rider.
Obviously there must be some amount of people doing it, otherwise Lyft wouldn't bother with this at all, but it's just so... unexpected.
> "Obviously there must be some amount of people doing it"
There's no data that this is true. It could simply be a project manager deciding that it's a good thing or the result of a tiny fraction of complaints.
From my experience building filters, this is one of the most frustrating tasks in software development, because getting it right is thankless and highly contextual, and if you get it wrong, the results are obvious.
Edit: Actually, it's worse bc if you get it wrong you open up your employer to getting called out for cultural insensitivity.
You can never get this 'right' and trying to is a waste of time and money. Words are just words. Trying to police what words a person has in their name, username, email, etc. is wrong and has unintended consequences. What's inappropriate to these 'decision makers' is actually the given names of some of their users.
You should never try to define 'appropriate language' and then impose your definition of that on to the world. Especially if you want to do business.
With a userbase as big as Lyft's, I'm sure there were a ton of obvious true positives (anyone named "Fuck", ostensibly). I just can't believe they didn't notice a surname as relatively common as Cumming/Cummings. Yes, the apparent naivety of the regex is an issue, but this seems like a system for which a lot of testing on actual data would be easy and natural to do as part of the QA process.
It always amazes me when US Americans talk in terms of human races like it's the most normal thing in the world. And no, “Cummings“ is not a “Caucasian name“ (whatever that is supposed to mean), it's an English name. I don't want sound too harsh, it's just that as a Central European with all our recent history, things like this really rub me the wrong way.
There are a number of these things, though. I personally get a little irritated with the way many people here use terms like "Hispanic" to contrast with "white people". To me, "Hispanic" is a wide spectrum of ethnicities but really more of a linguistic, cultural and national/political specifier, with lots of overlap with "white". People who are trying to be correct about this will pull out complicated phrases like "non-hispanic whites" which may be correct but are also clumsy, reflecting the absurdity of the way we categorize this country's social boundaries in the first place. "White" itself has had a fluid definition in the US over time, previously having excluded southern or eastern Europe, Jews, or even the Irish.
I know that in my own recent ancestry I have at least 4 European ethnicities, some of which would have been ridiculed by other Americans in the past. But this nuance washes away and I am a "white guy" which lumps me together with cultures I don't particularly identify with. There is no sense in denying, for example, that I am less discriminated against by appearance than other groups. But I do not think either that there is one homogeneous "white people" living in the United States.
Dead Comment
Dead Comment
This guy would disagree:
http://mw.eco.br/ig/prof/ReinhardtAdolfoFuck.htm
Along the same lines,
https://www.nie.edu.sg/profile/chew-shit-fun
BTW, isn't it insulting to those people to suggest that their names are insulting?
I once read a story of a Vietnamese man named Hui. He had to go to the court to protect his right to be named this way in Ukraine where the word is used widely and has only one meaning - a dick.
There was a study linked to on HN a couple of years ago that found that babies recognize swear words even before they have language skills, and that people can sometimes recognize swear words in languages they don't speak. It has to do with the tone and sharpness of the sounds.
That's what you think the world should be like, not the way it is. If your name is Adolfhitler Smith, a perfectly legal name in the US, then you will have trouble getting hired.
That's a made up example, but there's evidence[1] that unusual (not necessarily obscene) names will lead to less success in job applications.
If you have a "bad" name, it's your problem well before anyone else's.
[1] https://www.nber.org/digest/sep03/w9873.html
If I yell insults at you, you are saying that doesn't mean anything? And if you get insulted, it is your own fault?
No, words have meaning based on a common understanding between the person saying the word and the person hearing the word. If I insult you and you understand it, it is me who did the insulting, it is not 'your problem'
You should try this argument out in court: "No, your honor... I can't be guilty of threatening to kill someone.. sure I said, 'I am going to kill you tomorrow at 5pm', but those were just words! If the person felt threatened, that is on them!"
Good luck using the N-word in the US.
Which makes sense. There's likely a lot of drivers who want to go by "Tom" instead of "Thomas" or whatever their full official name is – which is even more important for drivers with very uncommon names who want to go by something more familiar/pronounceable. There's probably a decent safety argument for not requiring drivers to have their full real names be accessible to the user, especially in cases where a driver has an uncommon (i.e. easy to doxx) name.
The Simpsons figured this out decades ago: https://www.youtube.com/watch?v=M1EjcWU6sEk
Regardless, your point still stands. The product manager should have gone through whatever process was needed to pull down a limited set of production data to test against. Or the engineers could have logged exceptions in prod before making the filter active.
They could just generate the list of questionable names and get support to give them a call about it or institute a rider identity verification programme.
The intentions can be noble, such as a Lyft employee seeing an user named “F* N*” and thinking, hey that shouldn’t be okay.
The actual consequences can be meh.
"I'm so sorry my name is offensive!"
To this day, emails from that account have "Ryan <lastname>" in the header.
Implying devs test their code thoroughly. lol. After working at numerous companies (some with high bars of entry) I've noticed many devs push changes without testing properly. Or, if they test, they rarely consider edge cases or implications of what they're doing and only test the obvious base cases.
He had his own flying car, in 1961!
https://en.wikipedia.org/wiki/The_New_Bob_Cummings_Show
http://www.comins.org/cummingsname.htm
Deleted Comment
How very dare you. I'll have you know I come from a long line of Fuckers.
Dead Comment
I learnt that Apple has a validation endpoint. Since the engraving service was recently updated, I took a profanity wordlist and checked it against the endpoint just for fun. The results are...counterintuitive: https://pastebin.com/mzpECiQw (NSFW language, obviously)
I can't imagine Jobs putting up with trying to swear at someone over text and repeatedly getting "ducking."
Android is the superior experience in almost every UX category imo, and text handling is the best example of that.
sent from my iPhone.
By the way Semen is the way the Russian equivalent to Simon is spelled officially. There is quite a number of Russian men who have just that for their first name in their passports. I once met one, he worked as a developer and people were making fun of his name.
[0] curl -X POST https://accounts.api.playstation.com/api/v1/accounts/onlineI... -H "Content-Type: application/json" -d '{"onlineId":"hitle123987","reserveIfAvailable":true}'
Anyway, I switched languages and I'm "zed0134" on their network. The whole thing struck me as a bit odd.
What happened was, the German localisation of the app description included the word “Knopf”. Knopf is not a rude word, according to any German I’ve discussed this with — it is one translation of “knob” in the sense of “button”, but Apple’s naughty word detector clearly thought it was “knob” in the sense of the euphemism for a body part.
It didn’t stop the app passing review, but the automatic warning was still a regular part of updates for that particular app.
[0] Back in 2012
1. People who use them for that just come up with new ones all the time
2. People still use the word in the regular sense, like would I now have to come up with a euphemisms to describe a door knob ?!
3. How I wore is my decision. As long as I don't hurry anyone intentionally or knowingly no person and even less company had the (moralic) right to constraint me. (Through wrt. Minirs, Y least younger ones, their parents opinion matters, too.)
https://en.wikipedia.org/wiki/Alfred_A._Knopf
https://wireframe.raspberrypi.org/articles/banned-character-...
We see the same phenomena with encryption, authentication, and password security. People just tend to have a myopic understanding of the topic and vastly overestimate their ability in something they've barely even studied, which is the worst kind of ignorance. And they very much deserve to be named and shamed when they inevitably screw it up.
When dealing with names & addresses, consult an expert.
And phone numbers, and dates.
the scunthorpe problem will never die.
Deleted Comment
As a programmer, it's difficult how little of the world follows consistent "rules" that you need to build straightforward conditional logic.
Really, the only surprising thing is that Lyft didn't cross this bridge years ago.
I ended up taking my name off (the emails and website) of an open source project I maintain just to please the spam filters: https://twitter.com/j__st/status/1210559583934189568?s=19
I can appreciate the funny side of it, but it's also profoundly saddening somehow. I was born with this name, what do you expect me to do?
And who is bothered by this, really?
I find it hard to imagine that it's the drivers calling themselves Dick Cummings out of immaturity. So what does it really matter what the customers call themselves?
That was my thought too. It's one thing if you're using obscenity in an anonymous online alias, as a great many Redditors seem to love doing, but I'm struggling to believe that it's something people do in signing up for real-life ride-sharing apps, either as a driver or rider.
Obviously there must be some amount of people doing it, otherwise Lyft wouldn't bother with this at all, but it's just so... unexpected.
There's no data that this is true. It could simply be a project manager deciding that it's a good thing or the result of a tiny fraction of complaints.
Edit: Actually, it's worse bc if you get it wrong you open up your employer to getting called out for cultural insensitivity.
You should never try to define 'appropriate language' and then impose your definition of that on to the world. Especially if you want to do business.