The strangest letter of the alphabet: The rise and fall of yogh

with lang_sounds as ( select lang, unnest(string_to_array(ipa, null) ) as sound from world_dictionary ), totals as ( select lang, count(sound) as sound_count from lang_sounds group by lang ) select lang, totals.sound, count(sound) / totals.sound_count from lang_sounds join totals on lang_sounds.lang = totals.lang where sound = 'θ' or sound = 'ð' or sound = 'θ̠' or sound = 'z' group by lang, sound order by count(sound) / totals.sound_count

I want to see English become mor fonetik and mor regular.

As I watched my son learn to speak, then later to read and write, I paid attention to his misspellings and they all made perfect sense if you approached the language phonetically.

The first sentence he wrote for me was:

“my daddy and i tocd on d woki toki” (“My Daddy and I talked on the walkie talkie.”)

At first confusing, but it omits all the silent, or effectively silent because native speakers usually omit the sound, or abbreviate it so much that non-native speakers would miss it if they didn’t know it was there. Like “l” in “talked”.

And don’t get me started on irregular conjugations.

efunnekol · 5 months ago

With the very wide variety of English accents around the world, you would have a tough time creating phonetic spelling in English that everyone would agree to. My mother (a true Brit) might want 'wahtah' for water, where I think it is perfect as is.

With that said, there are clearly some things that could be changed.

madcaptenor · 5 months ago

As a Philadelphian, that should be "wooder".

thebruce87m · 5 months ago

> a true Brit

There are a wide variety of accents in Britain alone - ‘wahtah’ isn’t recognisable to me for example. It’s a cliche to say brits don’t pronounce “r”, but all you have to do is watch Harry Potter to see all the different accents that do.

jlos · 5 months ago

You want ENGLISH to be more phonetic?

English, a language with over 40 different dialects in its country of origin.

English, the official language of over 60 countries?

English, the bastard child of millenia of Roman, Germanic, and French colonization?

English, a language with documented vowel shift that occured over 200-300 years?

THAT language would be easier if the words were spelled how they were pronounced?

makr17 · 5 months ago

"The problem with defending the purity of the English language is that English is about as pure as a cribhouse whore. We don’t just borrow words; on occasion, English has pursued other languages down alleyways to beat them unconscious and rifle their pockets for new vocabulary."

--James D. Nicoll

Pet_Ant · 5 months ago

I'm reminded of a meme that compared "car keys" with the word "khaki" in a 'Boston'-accent (non-rhotic).

Or the "pin=pen" merger of some accents, or "caught=cot". No one speaks "English", it's an ugly compromise of an army of dialects.

monknomo · 5 months ago

Which accent should the phonetic representation be based on?

Thick lousiana accent? Southeast london accent? Boston southie accent? Mid-atlantic?

No matter what you choose, it won't be phonetic for someone

thefringthing · 5 months ago

Kingsley Read went with something like an artificially rhotic midcentury RP when designing the Shavian alphabet. It works reasonably well for me, a Standard Canadian English speaker, except that distinguishing all the open vowels (father/bought/bot) can be a struggle,

efitz · 5 months ago

I believe that there is a lowest common denominator for accented English, and it sounds like “sung” English - which sounds like non-regional American English if the singer is singing and doesn’t naturally have a very pronounced accent. This is partially due to cultural effects but partially due to the mechanics of speaking with an accent.

By “lowest common denominator“, I mean omitting affectations like non-rotic R (Boston) and dropped g (urban) and exaggerated length of selected vowels (US southeast) and constant syllable length and other kinds of things. There are lots of videos on YouTube about how accents work.

But mainly phonetics is about eliminating silent letters, using the most unambiguous letter when there is a choice (eg K or S instead of C), etc.

Anyway I’m rambling but you probably get my point.

sorrythanks · 5 months ago

type in your own accent, it works when we talk

tracker1 · 5 months ago

The west-coast US/Canadian accent most likely as it's most prominent in popular media, and likely the largest English speaking accent group.

But good luck getting the UK or India to adopt it.

ericmay · 5 months ago

> I want to see English become mor fonetik and mor regular

I don’t. The complexity and history is what makes life fun, and makes English one of the most interesting languages read, written, and spoken today.

Similar to Imperial versus Metric. Yes the metric system is better (aside from temperature ranges - Fahrenheit is superior) but it’s not nearly as fun. Why 12 inches in a foot? Who knows!

efitz · 5 months ago

I don’t think it’s etymology and irregularity that make English “fun”; I think that it’s English’s large number of concepts because we freely appropriate words and their ideas from other languages (“schadenfreude”, “Deja vu”, etc), the small alphabet, the lack of diacritics, and the flexibility that has been forced onto it primarily by American speakers abusing grammar- verbification, etc.

I think that simplified English is the most likely candidate for a universal human language. The “Unicode of languages”, if you will.

cenamus · 5 months ago

The writing system has 0 to do with that. More the UK and USA being superpowers for ages.

There is no reason why English would be more interesting to native speakers than German, Spanish, Finnish or Bulgarian to their respective native speakers.

fouronnes3 · 5 months ago

> Yes the metric system is better

You're changing the definition of "better" so you can have your cake (admit X is better than Y) and eat it too (prefer Y over X). It's ok to like fun things! It's ok to like Imperial and weird grammar, but then you can't also claim that you agree it's worse.

vinajuliette · 5 months ago

Most imperial units are how they are because they can be easily divided into a bunch of different ways - 10 can only be divided by 5 and 2, 12 can be divided by 2, 3, 4, and 6. This is practical when you have a physical amount something you need to deal with, like re-portioning a recipe.

spookie · 5 months ago

How is Farenheit superior? If I may ask.

bobsmooth · 5 months ago

>Why 12 inches in a foot?

Lots of divisible factors.

HPsquared · 5 months ago

Spelling preserves a lot of meaning and etymology in words. Phonetic spelling would remove some of this richness from the written language. Logographic systems like Chinese take this to the extreme. I think English strikes a nice balance.

asveikau · 5 months ago

Spelling can also bridge the gap between phonetic differences between accents. Many variants of English don't even have the same number of sounds as others, but we all "come together" when we write.

In English, it would frustrate me if every time I read marry it would be spelled the same as merry, because I say those differently, even if most Americans I come in contact with daily do not.

Spanish spelling is much more phonetic than English, but one thing I admire about it is when things are correctly spelled, you can adjust the text for accent based on spelling. The only area I can think of where spelling does not denote contrast in any accent is between b and v. (Edit: and that h is silent.) There are some contrasts that are only kept by a minority of speakers (Ll is merged with Y for most people, but maybe not in Bolivia or parts of Spain; s and z are merged in Latin America), but it's still a meaningful contrast for somebody.

r_lee · 5 months ago

Finnish is basically 100% phonetic. A raw Finnish English sounds pretty funny but it's basically pronouncing every character as it's written

yboris · 5 months ago

Plug for the Shavian alphabet - https://en.wikipedia.org/wiki/Shavian_alphabet

"a constructed alphabet conceived as a way to provide simple, phonemic orthography for the English language to replace the inefficiencies and difficulties of conventional spelling"

nielsbot · 5 months ago

Couldn't you use a subset of IPA for this? (Not a language person, so I don't know)

madcaptenor · 5 months ago

My favorite example of a phonetic misspelling - when my older daughter was learning to write, we were potty-training my younger daughter. The older one made a sign "PODE CHRAN FOR [younger daughter's name]" and taped it to the wall over the potty.

tim333 · 5 months ago

Maybe we could have a separate version that was fonetik and regular and the kids could learn it? They could always use google translate to deal with texts in normal English like I do with most languages.

Edit - I looked up and there is a translation thing online.

"this is text translated to a phonetic version" goes to "ðɪs ɪz tɛkst trænzˈleɪtɪd tuː ə fəʊˈnɛtɪk ˈvɜːʒᵊn" in British pronunciation and "ðɪs ɪz tɛkst trænˈsleɪtɪd tu ə fəˈnɛtɪk ˈvɜrʒən" in American.

Not sure it's going to catch on.

bradrn · 5 months ago

> "this is text translated to a phonetic version" goes to "ðɪs ɪz tɛkst trænzˈleɪtɪd tuː ə fəʊˈnɛtɪk ˈvɜːʒᵊn" in British pronunciation and "ðɪs ɪz tɛkst trænˈsleɪtɪd tu ə fəˈnɛtɪk ˈvɜrʒən" in American.

These samples are using the International Phonetic Alphabet, which isn’t really designed for the purpose of being a practical orthography — it’s used as a language-independent transcription method which can represent any sound. (That’s also why the British and American versions are different: different accents use different sounds, so they’re transcribed differently.) So this isn’t really a ‘phonetic spelling of English’, though it’s easy to see where the confusion comes from.

WalterBright · 5 months ago

Learning to read phonetically still pays off handsomely, despite the erratic spellings of English.

I sometimes discover that I mispronounce words I know very well. The reason is that I never heard those words used audibly, I only encountered them in books. So I "invented" a phonetic sound to them, and long ago forgot that I did that. Until whammy, I encounter it verbally, decades later.

xanderlewis · 5 months ago

The impressive part is that he said '...and I' rather than 'me and...'!

Dwedit · 5 months ago

https://guidetogrammar.org/grammar/twain.htm

(Not actually by Mark Twain)

SketchySeaBeast · 5 months ago

What about homophones that are clearly different when written? I suppose that would be balanced out by the heteronyms.

taneq · 5 months ago

"Ze drem vil finali kum tru."

(What was that essay/joke called again?)

dwayne_dibley · 5 months ago

ah, 'newspeak'

"English spelling has a reputation. And it’s not a good one." - never have i ever agreed with anything more

different hill, but one I would die on is: as the letter "c" should make the "ch" sound, the letter "c" serves no purpose not already handled by "s" or "k" otherwise

murderfs · 5 months ago

https://guidetogrammar.org/grammar/twain.htm

  For example, in Year 1 that useless letter "c" would be dropped to be replased either by "k" or "s", and likewise "x" would no longer be part of the alphabet.

  The only kase in which "c" would be retained would be the "ch" formation, which will be dealt with later.

  Year 2 might reform "w" spelling, so that "which" and "one" would take the same konsonant, wile Year 3 might well abolish "y" replasing it with "i" and iear 4 might fiks the "g/j" anomali wonse and for all.

  Jenerally, then, the improvement would kontinue iear bai iear with iear 5 doing awai with useless double konsonants, and iears 6-12 or so modifaiing vowlz and the rimeining voist and unvoist konsonants.

  Bai iear 15 or sou, it wud fainali bi posibl tu meik ius ov thi ridandant letez "c", "y" and "x" -- bai now jast a memori in the maindz ov ould doderez -- tu riplais "ch", "sh", and "th" rispektivli.

  Fainali, xen, aafte sam 20 iers ov orxogrefkl riform, wi wud hev a lojikl, kohirnt speling in ius xrewawt xe Ingliy-spiking werld.

SamBam · 5 months ago

> fiks the "g/j" anomali wonse and for all

But we'd still be arguing about how to pronounce "ᵹif"

the_lucifer · 5 months ago

I remember a version which ends with how we'll end up speaking German.

pmcarlton · 5 months ago

The nice thing about this passage is it reflects the extent of Twain's non-rhotic dialect -- he keeps the R in "year"/"years", "orthographical", and "world" but drops it in "after", "letters", and "dodderers". So only dropped in final unstressed syllables of multi-syllable words.

mixmastamyk · 5 months ago

Recommend X for the ‘sh’ sound, as it is pronounced that way in languages like Portuguese. Y is a common typographical substitute for theta/thorn, as in “ye olde shoppe.”

tim333 · 5 months ago

By the way the source was a Mr Shield's letter to the Economist rather than Twain https://web.archive.org/web/20200311221105/http://www.letter...

bmacho · 5 months ago

I'm convinced that this is Just The Right Thing To Do. Like ridiculously strong benefits, and practically no drawbacks at all.

Dead Comment

kevin_thibedeau · 5 months ago

English's spelling irregularities help with disambiguating homophones:

  cent / sent / scent
  ceiling / sealing
  cite / sight / site
  colonel / kernel
  carrot / karat
  cue / queue

jleyank · 5 months ago

Which, of course, does not help things like polish polish (made in Warsaw) and to produce produce (pull apples out of a bag). However you look at it, when they set up English words and spelling there was large quantities of alcohol involved.

jraph · 5 months ago

Only in writing. The disambiguation is already needed when spoken and the context does this.

hajile · 5 months ago

If you look up these words in the dictionary, the same word with the same spelling very often has several different definitions that are often unrelated because homographs (same spelling, but different meaning) are super-common in English. Dictionaries don't account for newer or more niche meanings of words either.

How is it that you can say these words without confusion?

Language is context sensitive and you understand the word based on the context around it. Likewise, you understand homographs based on the context. Because of this, spelling isn't as important as it might appear.

its-kostya · 5 months ago

On paper, yes. But not when someone speaks. If you used a homophone while speaking, the listener would be able to distinguish which variant the talker intended based on context. I would argue this is enough of a reason for written text as well.

int_19h · 5 months ago

Some other languages do the same with diacritics.

Most don't bother because context is nearly always sufficient.

normie3000 · 5 months ago

And cause confusion with needless heterographs?

practice / practise licence / license

jmyeet · 5 months ago

"Ch" is a strange hill to die on. "Ch" has a mostly consistent pronunciation (eg chair, touch, chain, choke, recharge, etc) that no other letter combination does.

Exceptions to this are generally loan words, particularly from French (eg chaise, which sounds more like "sh"). Others are harder to explain. "Lichen" springs to mind. Yes it technically comes from Latin but we're beyond the time range to truly consider it a loan word.

There are also some "ch" words of Greek origin (IIRC) that could simply be replaced with "c" or "k" (eg chemistry, school).

"Kh" on the other hand I think is entirely loan words, particularly from Arabic. Even then we have names like "Achmed" that would more consistently be written as "Akhmen". "Khan" is obviously a loan word but I think time has largely reduced the pronunciation to "karn" rather than "kharn" if it ever was that.

But I can't think of a single "kh" word that pronounced like "ch" in "chair".

"Sh" doesn't seem to crossover with any of these pronunciations.

jacquesm · 5 months ago

In Dutch and German Ch is pronounced as 'g'.

arkensaw · 5 months ago

> It’s full of silent letters, as in numb, knee, and honour. A given sound can be spelled in multiple ways (farm, laugh, photo), and many letters make multiple sounds (get, gist, mirage).

that last one is hardly fair - gist and mirage are french words. might as well complain about the silent letters in rendezvous or faux pas.

pessimizer · 5 months ago

Almost every English word is French, except for the most important ones.

ochrist · 5 months ago

In Danish knee is 'knæ' and the K is pronounced very clearly. It's interesting that English speaking people have forgotten how to pronounce K before N, so the Danish king Knud became Canute.

user982 · 5 months ago

But which "ch" sound? "Ch" as in "church" is just "tsh". "Ch" as in "charade" is just "sh".

cwnyth · 5 months ago

Seconding this. C should be the ʃ sound, and then TC should be the "ch" in "church." The fact that there's no one letter for ʃ is the real tragedy.

o11c · 5 months ago

I've played around with respelling quite a bit; one of the most difficult adaptations is forcing yourself to correctly use "dh" (few-but-common words, "thy", "either", "teethe") vs "th" (most words, "thigh", "ether", "teeth").

j -> dzh is more weird than anything.

Vowels, of course, are a cause of war between dialects; nobody can even agree how many there are.

int_19h · 5 months ago

I kinda wish English avoided Xh type digraphs because they screw up common borrowings like Thai. Sure, that's not strictly phonemic in English, but I think realistically given how readily English adopts foreign words into it without completely nativizing them phonemically, any orthography should strive to reflect that, meaning that combinations like "th" should have their obvious meanings that can be inferred by native speakers even if such a sequence never occurs in native words.

Esperanto has a nice trick where they reserve "x" as a modifier letter, so if you can't use diacritics you write "cx", "sx", "jx" etc; but it does not have a sound value of its own and can never occur by itself. We could extend this to "tx" and "dx" with obvious values, and also to vowels - "a" for /æ/ vs "ax" for /ɑ/, "i" for "ɪ" vs "ix" for /i/ etc. Using "j" the way it is today feels somewhat wasteful given how rare it is. In the x-system it would probably be best represented by "gx", and then we could have a saner use for "j" like all other Germanic languages do. Which would free up "y" so we could use it for the schwa.

One thing that occurred to me the other day is that "x" is also a diacritic, so we could just say that e.g. "sx" and "s͓" are the same thing. Then again from a purely utilitarian perspective a regular dot serves just fine and looks neater (and would be a nice homage to Old English even if ċ and ġ are really just a modern convention).

Vowels, yeah... I think it's pretty much impossible to do a true phonemic orthography for English vowels that is not dialect-specific. As in, either some dialects will have homographs that are not homonyms, or else other dialects will not have the ability to "write it as you speak it" because they'll need to use different letters for the same (to them) sound. In the latter case, it would become more of a morphological orthography. Which would still be a massive improvement if it's at least consistent.

OTOH if you look at General American specifically, and treat [ə] and [ʌ] as stress-dependent allophones, then you can get away with 9 vowel characters in total (ɪiʊuɛəoæɑ). That's pretty easy with diacritics.

emmelaich · 5 months ago

Bring back þ!

zcdziura · 5 months ago

I completely agree with you. I've taken an amateurish interest in linguistics over the past couple of years, and I've often thought that it might be a fun exercise to come up with a phonetic alphabet for the English language. Use the letter 'c' to represent /ch/, 'x' to represent /sh/, etc.

Maybe as a fun pet project someday!

shemtay · 5 months ago

Words of Latin origin are identifiable at a glance, and homonymic collisions are thereby avoided

-- Caeser, seizer of the day

user982 · 5 months ago

"Caesar" was pronounced in Latin with a hard C, which is preserved in German ("Kaiser").

tbrake · 5 months ago

Changing "cube" to "kube" would just look like it's pronounced "koob" (e.g. rube, tube, lube), so we swap a minor spelling aggravation for a minor pronunciation edge case. unless you want to go full kyube but we're not putting that on the table.

cvoss · 5 months ago

Well, it would be a step backward in the right direction to go with spelling it 'kube' and pronouncing it 'koob'. That would hew to the original Greek. We'd also bring cybernetic back closer to kubernetes. And circle to kuklos. (Side note: It's another spelling "error" that we use 'y' in English to transliterate the Greek upsilon, which looks like 'Y' when capitalized, but is really a better match to 'u'. Hence, hyper and hypo instead of huper and hupo (like super and sub).)

nicoburns · 5 months ago

kyube or kyoob would definitely be the way to go.

It's funny you use "tube" as an example though, as in my British accent I pronounce that as "chube", whereas I believe many Americans would use a "t" sound for that word. Not sure how you settle on a spelling in those cases.

int_19h · 5 months ago

Why would it? "u" generally doesn't follow this pattern in English after "k" any more so than it does after "c".

That aside, what you describe is a distinction between yod-dropping and lack thereof, and whether and where it happens is highly dialect dependent.

hajile · 5 months ago

This is an issue because vowel letters/digraphs are much more inconsistent than consonant letters/digraphs.

Swizec · 5 months ago

> as the letter "c" should make the "ch" sound

What’s the ch sound? My intuition from German class is that ch represents a throaty hhhh. Somehow that got spoiled into k in most English words.

Every c in Pacific Ocean is pronounced differently. C is a silly letter.

1718627440 · 5 months ago

> My intuition from German class is that ch represents a throaty hhhh

If you mean the standard German from Germany, there a two variants. At the end of a syllable it is like you described (kind of throaty hhhh). For the beginning of syllables think of sh and open your mouth.

microtherion · 5 months ago

> My intuition from German class is that ch represents a throaty hhhh

It varies between dialects. Swiss German speakers tend to stick out to Germans because we pronounce the ch in a much scratchier way than is accepted in Standard German.