Readit News logoReadit News
Posted by u/hbarka 5 months ago
Ask HN: Why is uptalk intonation so prevalent in ChatGPT voices?
I’ve tried asking it to set voice with an even tone and less of the annoying uptalk but lately it just continues in this way. It hurts to listen to.
slillibri · 5 months ago
They are still working on a realistic vocal fry?
mikrl · 5 months ago
When can I get the GPT that sounds like Boomhauer crossed with Gerald from Clarksons farm?
throwaway889900 · 5 months ago
Just gotta pipe it through a granular synth afterwards
AStonesThrow · 5 months ago
The new office mate’s prank will be to switch your AI’s voice to a Kardashian, Fran Drescher, Pauly Shore...

Plus I am sure that LLM engines could command a premium by shrewdly licensing such talent as James Earl Jones, Majel Barrett, or Milla Jovovich?

But it seems like the current trend is novel/generic voices in order to avoid suits/fees and pioneer new territory. Isn‘t Siri‘s personality a recognizable celebrity by now?

seydor · 5 months ago
they can use Sam's voice
ctrlp · 5 months ago
Why is it so prevalent in people generally?
rglover · 5 months ago
Because they think it makes them sound Smart? Because it's safer to fit in with the crowd than to Not? Because having a genuine personality is Difficult?
ctrlp · 5 months ago
I would assume the opposite. People use it to sound dumb, not smart, so as to sound non-threatening, but also so as to sound non-assertive. It's used to defuse potential conflict or perceived disagreeableness. Generally the affect of a late-stage conflict-adverse institutionalism that punishes assertive or dominant behaviors. Uptalk is the "I'm showing my belly" of English affects.
_DeadFred_ · 5 months ago
The point of language is to have shared communication. That people adopt language standards isn't a moral failing, and accent is a legitimate part of it as it's more organic/instinctual to human interaction than grammar/diction classes.
alabastervlog · 5 months ago
As far as I can tell, it's an "I am not done talking yet" thing in many contexts. That's why people employing it usually drop the uptalk for their last sentence in a string of sentences (unless it's an actual question).

I reckon it took off because of telephones.

muzani · 5 months ago
It turns a sentence into a suggestion rather than a command.
chc4 · 5 months ago
nothing much, what's uptalk with you?
nativeit · 5 months ago
I'm having a rough couple of months, I'll be honest. [that should be read in the least sincere "my pleasure to serve you at the window" voice you can muster]
swyx · 5 months ago
Made in California. next question
hbarka · 5 months ago
There’s no shortage of YouTube videos aware of the uptalk or upspeak annoyances but I found this one from 1994. It seems to have spread from California Valleygirl-speak and then nationwide to college campuses. How does LLM training get influenced (weighted) by pop culture speaking styles?

https://youtu.be/z756L_CkakU

swyx · 5 months ago
pretrain data + rlhf
ryandrake · 5 months ago
I wish it were only a California thing. The Valley Girl uptalk/vocal fry thing seems to have spread across the country. Turn on the local news station in any region of the country and you'll hear it. Everyone is for some reason trying to sound like the Real Housewives Of Orange County.
alabastervlog · 5 months ago
NPR's even been full of it for more than a decade now. I think at some point (the '00s?) they really relaxed their elocution standards for hosts & reporters.

It definitely makes reporting feel trustworthy and serious? When almost every statement sounds like a tentative question?

TRiG_Ireland · 5 months ago
"Vocal fry" is not really a thing, as David Peterson explains. https://www.youtube.com/watch?v=qIJyEc07w2Q
saltcured · 5 months ago
More like trained by a certain generation and socioeconomic strata...

I've gotten old enough to now wonder if my dialect sounds like something from another world and era to younger folks in my region.

The way I felt about most of the Hollywood actors I heard from before technicolor was the norm.

orblivion · 5 months ago
In San Francisco I had a coworker originally from Italy who used upspeak while speaking in an Italian accent.
hbarka · 5 months ago
How did they manage that? The Italian accent is beautifully affirmative and confidently downspeak when making a statement.
carabiner · 5 months ago
Made in California? Next question?
barbazoo · 5 months ago
Made in California! Next question!
nottorp · 5 months ago
Oh cmon. Californians are too expensive for training AI. Maybe it's from some malaysian accent?
wongarsu · 5 months ago
Unless you train on twitch streamers. There seems to be an unspoken rule that any successful streamer has to move to LA. If you train on youtubers you get a surprising Mormon bias instead.

I would be surprised if the majority of the training data is licensed from the speakers.

muzani · 5 months ago
Malaysian here. Uptalk is used more to turn a sentence into a suggestion. Something like "Hey, there's one dim sum left," to suggest that I'm taking this but you can challenge it. I could see why ChatGPT would adopt it. It's trying to be polite.

Often it's in a tonal particle, "One dim sum left meh." But it's possible in trying to artificially combine tone and text, the uptalk is moved up.

But the tell tones of a Malaysian accent is it's clipped. Instead of "I don't like that idea," it becomes "Don't like it." ChatGPT may be written American, so as an accent, it would sound closer to, "I- don't like, that idea."

And sentences often end in an elongated manner, "I wrote that is essay you wanted~". The elongated ends are quite common in many SEA accents as well, especially Thai.

Deleted Comment

breckinloggins · 5 months ago
This a highly problematic comment?

/s

uoaei · 5 months ago
What are you getting at? What's the joke?
ViktorRay · 5 months ago
I believe OpenAI wants ChaptGPT to have a tone that is more casual and less professional or uptight than it was before....

And so ChatGPT relies on the training data to know what that means so it leads to it talking like this as this is what the training data is filled with.

ergonaught · 5 months ago
Just telling it "Avoid upward inflection" or "Use a flat tone" prevents the lilt for me. Perhaps it doesn't "stick"? Perhaps it varies by voice.
hbarka · 5 months ago
It partially works but it doesn’t “stick”, as you say. I’ve tried setting it on Preferences but it isn’t consistent.
luluthefirst · 5 months ago
It generates more engagement than a monotonous tone.
psygn89 · 5 months ago
If only there was something in between the two.
muzani · 5 months ago
AI accents are incredibly good these days, especially eleven labs. ChatGPT is not a leader in this. I spent about $20 on this before just because I like the sound of its voice.
gtirloni · 5 months ago
I'm sure your intonation sounds annoying to some other generation too.