It's fun but as I tested it I realized how this is pretty much the modern equivalent of a Facebook quiz that asks you the name of your first pet, first car and mother's maiden name.
If they recorded any of that they likely have enough to clone my voice somewhat faithfully.
Congratulations on labelling my French Canadian accent as French though, I'll have to work on my pronounciation more to fool the AI.
Does it regionalize at all? There are a few comments about Portugal Portuguese vs Brazil Portuguese so I think it only tries to find your first language, not actually pinpoint the underlying accent.
It would be nice if they were clear they wouldn't keep the sample on the page. They do have a privacy policy on their main site www.boldvoice.com/privacy
It didn't guess for me other than to say I was a native speaker.
Hi, founder here! We're a YC-funded education app with over 1 million downloads, currently #7 on the Education chart of the App Store. The goal of the accent oracle tool is to help more non-native English speakers find out about BoldVoice. While it's important as consumers to be aware of data privacy with AI tools in general, I want to set the record straight that that's not the point of what we're doing here.
But great to hear that we got your accent right :)
Hey! Just to be clear I really didn't want to imply that you were trying to clone my voice. I think your product is very interesting and this demo is a nice showcase of your tech.
It was meant more as a reflection on how, due to how fast threat models evolve, even the seemingly innocuous act of recording 2 sentences on a random website can now be used to break into my bank account.
I tried the app, and it's great. As a near-native French speaker I would be interested in working out my French pronunciation, hopefully you will go multi-language soon.
Did you know that something like 80% of the websites you visit are tracked by a handful of companies which are all effectively sharing that with many thousands of ad and data broker companies which can easily narrow you down? It's not tinfoil hat land to knkw that nearly every site you visit has cross-site tracking.
This was a big confidence booster for me as when I first started learning English, people would complement me on how well I spoke English, but I took that as my accent was still detectable. It's only been in the past 5 years that people assumed I was American and made no comment on my English at all, until I disclosed that English was my second language. It's usually certain words that give me trouble, like "cupboard" or "chef". The AI detected my accent as a mixture of German and English. When I tried to exaggerate my accent, it correctly detected Thai.
If you learned English after 16. You probably still have an accent. Native speakers are really, really, really good at detecting it. They probably know as soon as you say "Hi".
If you tune up your detector that high it’s very likely you’ll get false positives. I’ve met Americans with fairly strong “accents” that aren’t necessarily a dialect either, just a different way of speaking. It could also be a mix with parents who are non-native. American English is vast, and very heterogeneous.
If we’re talking about specific parts like a regional dialect then I would agree, those are tricky to acquire later, at least to those undetectable levels. They can be extremely specific.
Yep big difference in accents of my cousins who moved here when they were 9 and another when they were 18. Now they are in their mid forties and you can still tell who moved when based on their accent. Its impossible to change your accent after late teens.
there are people who are better than other people at blending in their accents, even in more difficult languages than English (for accent coding), perhaps they are just very good at that.
those words are your Shibboleths, words that give your origin away.
When I was in Germany, friendly people used to compliment me on my language skill saying "your German is good!". To which I would reciprocate: "thanks, yours too!"
My ex-wife whose native language is Spanish worked hard to eliminate her accent because she got tired of people calling her accent “cute.” Her shibboleths were anything with a schwa. The whole concept of schwas offends her sense of vocalic purity.
I knew my accent was strong, but I didn't expect to get 100% Portuguese, which is strange since Portuguese from Portugal sounds more like Eastern Europe, and Portuguese Brazil is more like Spanish. Maybe it considers both accents to be Portuguese?
A fun fact: When using Whisper by OpenAI, there seems to be a ~1% chance that all my text, which was spoken entirely in English, is automatically transcribed and translated into pt-BR without any prompting. It happens more often when I am not paying too much attention to pronunciation.
The weird thing is that all the words were transcribed correctly (beyond being entirely in a different language)
> Portuguese from Portugal sounds more like Eastern Europe, and Portuguese Brazil is more like Spanish.
Surely you mean the opposite? Portugal is literally next to Spain and both languages have coexisted since they were both born following Rome's fall. Both Galician and Portugal's Portugese are likely similar to each other and closer to Spanish than Brazil's Portuguese
You are right regarding Galician, but Galician isn't Spanish, rather original Portuguese, where some words changed throughout the centuries.
The way we both speak is rather different than the native Spanish speakers, that never have been exposed to native Portuguese/Galician speakers.
In fact, even native Catalan speakers have easier time understanding Galician/Portuguese speakers, than Spanish speakers do, probably due to the French roots in Galician/Portuguese carried by the crusades involved in the founding of both regions and naturally influeced the language evolution.
One cool fact is that Galician sounds closer to Brazilian Portuguese. It also has much of the same vocabulary (with the notable exception of the days of the week which it borrows from the other Romance languages while Portuguese has its boring days of the week)
Presumably each training speech sample is labelled with native language. For Portuguese there would be two distinct clusters: Portugal and Brazil. If your speech is in either cluster, it would just tell you that your native language is Portuguese without being any more specific. Sure, it's a missed opportunity but it doesn't distinguish Jamaicans from Australians either.
I presume there's enough difference between English spoken by Portuguese and English spoken by Russians for those also to be distinct clusters.
The homepage sort of implies that "having an accent" is something only non native speakers do? Like an accent only comes from your exotic mother tongue. Kinda weird. It told me I'm a native speaker, and I am a non American native speaker so... good I guess?
I don't follow. Why is the data they are getting from this better than the billions of hours of captioned voice data available from youtube/tiktok/instagram/whatever?
I don't mean to suggest that this particular toy is much use. I just mean you should not give random internet games standardised samples of your voice, for this reason.
It's a standardised sample, already correlated to text, close to the microphone, for one thing. You're just making it easier for them.
I mean I suppose you can use "like and subscribe", "without further ado", and "let's get started" as standardised samples if you want to catch a youtuber.
But AFAIK my voice isn't on the internet anywhere. Quite a lot of people are not.
There's a number of ways this information can be connected back, with varying precision, to the person who recorded it.
And we should have learned from the Cambridge Analytica scandal that data is used in ways we do not expect. For example, what if you don't care to reproduce someone's voice, but you do care to extract age/gender/racial background/sexual orientation from it?
Now all they need to do is somehow work out who you are from only your IP - no email, name, location or anything - then simply get a voice cloning model to work perfectly from this small sample, then either somehow hack all the other information needed to get into your bank account or chase down your family to get them to send them crypto and they've got you dead to rights. Simple as that, which is why I also never take phone calls, pay for anything with a credit card or go outside.
I have often wondered how much of someone's actual voice leaks through into their impersonations in a way that can be detected.
I just saw an incredible Facebook reel of a voice actor, Shelby Young, saying the same thing in a striking range of theatrical voices, and I still wonder. How much of her true vocal fingerprint is unavoidably there?
(As a fan of old movies, "Vintage" was particularly impressive to me -- she is impersonating not just voices but also the choice of tonality those actors made in light of recording technology)
You enter your First Name, Last Name, Gender, Date of Birth, Pet's Name and Mother's Maiden Name and press the button to find out what your Mr T Name is...
The app this is advertising helps non-native speakers with their accent, I assume to sound more American. This is a great goal, and I'm sure there are a lot of people who would be willing to pay the $200-$300 yearly subscription cost. Apparently the AI part is not even the main function of the app, that's what the extra $100 are paying for[1].
I would be interested in an AI-only product that would help me learn to passably immitate various English accents, like Australian, Irish and so forth, for fun. I know that ChatGPT Voice can do accents pretty well, I've been wondering if it would also be able to help me with mine, but I haven't tried it seriously.
I could absolutely see people be willing to pay for this. I am from the Midwest in the United States and I happened to be at an airport in some foreign country. Someone else heard me talking and they came up and asked me where I learned to speak English because it was so smooth. They were looking to get lessons to make their English better or at least more smooth. I thought their English was fine and they were a bit disappointed when I mentioned I was from the United States.
It's kind of annoying when services like this provide a free trial that you have to give a credit card number to even try, capitalizing on people forgetting to unsubscribe after trying.
Also, I'm very suspicious when a credit card form is on $site.com rather than $financial-institution.com
My mind is blown right now. My whole life I've been told that my speech is so American and that I don't have a Russian accent (left Russia when I was 4). Lo and behold, this app tells me that my accent is Russian (61%) with English being a distant second (13%).
I tried it and it said English 93% (left same age as you).
Then I did my best Russian accent and on the first time it gave me Hindi/Urdu at 80+%. I tried it a second time rolling my r's a bit more and it settled on Russian at 70%.
I think it's very sensitive to specific tells and I suspect the dataset for Russian accents may not account for all the variations in regional pronunciation and dialects.
I left Russia around the same age and got 100% English. I can easily do a fake accent and get Russian though. Also some other accents like German, French and so on are pretty easy to get too.
It gave this native English speaker "Swedish" with p > 90%. Just confirms the feeling I get every time I go to Sweden that they really do speak better English than me.
It thought my native language was Hindi/Urdu which was amused me if only because whenever I try to do a foreign accent it eventually morphs into a Hindi accent no matter where it started.
If they recorded any of that they likely have enough to clone my voice somewhat faithfully.
Congratulations on labelling my French Canadian accent as French though, I'll have to work on my pronounciation more to fool the AI.
It didn't guess for me other than to say I was a native speaker.
It was meant more as a reflection on how, due to how fast threat models evolve, even the seemingly innocuous act of recording 2 sentences on a random website can now be used to break into my bank account.
If we’re talking about specific parts like a regional dialect then I would agree, those are tricky to acquire later, at least to those undetectable levels. They can be extremely specific.
When I was in Germany, friendly people used to compliment me on my language skill saying "your German is good!". To which I would reciprocate: "thanks, yours too!"
A fun fact: When using Whisper by OpenAI, there seems to be a ~1% chance that all my text, which was spoken entirely in English, is automatically transcribed and translated into pt-BR without any prompting. It happens more often when I am not paying too much attention to pronunciation.
The weird thing is that all the words were transcribed correctly (beyond being entirely in a different language)
Surely you mean the opposite? Portugal is literally next to Spain and both languages have coexisted since they were both born following Rome's fall. Both Galician and Portugal's Portugese are likely similar to each other and closer to Spanish than Brazil's Portuguese
You are right regarding Galician, but Galician isn't Spanish, rather original Portuguese, where some words changed throughout the centuries.
The way we both speak is rather different than the native Spanish speakers, that never have been exposed to native Portuguese/Galician speakers.
In fact, even native Catalan speakers have easier time understanding Galician/Portuguese speakers, than Spanish speakers do, probably due to the French roots in Galician/Portuguese carried by the crusades involved in the founding of both regions and naturally influeced the language evolution.
Even my slavic gf was tricked for a few seconds and wasn't sure if it was some kind of eastern language she wasn't aware of.
And for the explanation https://www.youtube.com/watch?v=Pik2R46xobA
I've heard it described as if a drunk russian was trying to speak spanish.
Dead Comment
I presume there's enough difference between English spoken by Portuguese and English spoken by Russians for those also to be distinct clusters.
Don't do this.
Or, given similar concerns about image and video generation, our faces.
It's a standardised sample, already correlated to text, close to the microphone, for one thing. You're just making it easier for them.
I mean I suppose you can use "like and subscribe", "without further ado", and "let's get started" as standardised samples if you want to catch a youtuber.
But AFAIK my voice isn't on the internet anywhere. Quite a lot of people are not.
There's a number of ways this information can be connected back, with varying precision, to the person who recorded it.
And we should have learned from the Cambridge Analytica scandal that data is used in ways we do not expect. For example, what if you don't care to reproduce someone's voice, but you do care to extract age/gender/racial background/sexual orientation from it?
I just saw an incredible Facebook reel of a voice actor, Shelby Young, saying the same thing in a striking range of theatrical voices, and I still wonder. How much of her true vocal fingerprint is unavoidably there?
https://www.youtube.com/watch?v=DyNnyeu_kPU
(As a fan of old movies, "Vintage" was particularly impressive to me -- she is impersonating not just voices but also the choice of tonality those actors made in light of recording technology)
https://www.boldvoice.com
You enter your First Name, Last Name, Gender, Date of Birth, Pet's Name and Mother's Maiden Name and press the button to find out what your Mr T Name is...
... Mr T says your name is FOOL
"My. Voice. Is. My. Pass. Port... Verify. Me."
No. Its fine. go ahead and do this ( if you want).
I would be interested in an AI-only product that would help me learn to passably immitate various English accents, like Australian, Irish and so forth, for fun. I know that ChatGPT Voice can do accents pretty well, I've been wondering if it would also be able to help me with mine, but I haven't tried it seriously.
[1] https://www.boldvoice.com/frequently-asked-questions
Do people want to learn to speak English like a twangy guitar on purpose?
Also, I'm very suspicious when a credit card form is on $site.com rather than $financial-institution.com
Then I did my best Russian accent and on the first time it gave me Hindi/Urdu at 80+%. I tried it a second time rolling my r's a bit more and it settled on Russian at 70%.
I think it's very sensitive to specific tells and I suspect the dataset for Russian accents may not account for all the variations in regional pronunciation and dialects.
Dead Comment