educaysean (u/educaysean)

educaysean commented on Google says no "African countries beginning with K" but Kenya has a "K sound" twitter.com/edzitron/stat... · Posted by u/ilamont

thatjoeoverthr · 2 years ago

This is a known design flaw of LLMs. This gets posted once a month for years. It’s disheartening to see very technically adept people still find this exciting. GPT3 has been out for years, why don’t more “hackers” know about tokenizers?

Basically, LLMs are “blind”. Fragments of text are converted to tokens, forming something like a big enum of possible tokens.

They can’t see spellings; they cannot see the letters.

So, they can’t handle things like “how many letters are in Mississippi?” reliably.

Due to chat bots running with nonzero temperature, they will sometimes emit a right answer just because the dice rolled in its favor. So if you go try this and get a good answer, that’s not conclusive either.

That’s the thing we’re dealing with, that’s how it works, that’s what it is.

educaysean · 2 years ago

The tweet in question was posted today. The point here isn't to rehash how LLMs can't distinguish letters from tokens. It's to highlight how Google's AI-generated answer will grab a blatantly false fact from the internet and use it as an authoritative source for its answers.

educaysean commented on Golden Gate Claude anthropic.com/news/golden... · Posted by u/l1n

w-m · 2 years ago

If you give it a reward for not mentioning the bridge or announce severe punishment for mentioning it, and then tell it to evaluate itself while writing, it will suffer a lot on some topics. Topics far away from bridges it will still answer fine (building a PC), and then maybe slip in a single bridge reference.

But asking for the countries in the European Union, it'll only list counties around the bridge. It then realizes it has failed, tries again, and fails again hard. Over and over. It's very lucid and can clearly still evaluate that it's going off, what it's doing wrong, but it just can't help itself, like an addict. I really don't like anthropomorphizing LLMs, it was borderline difficult to see how much it was struggling in some instances.

educaysean · 2 years ago

I love seeing when an LLM encounters a failure mode that feel akin to "cognitive dissonance". You can almost see them sweat as they try to explain why they just directly contradicted themselves as they spiral into a state of deeper confusion. I wonder if their response is modeled after human behavior when encountering cognitive dissonance. I'm curious how they'd behave if they had no model of human defensiveness in their training set.

Anyways I also don't enjoy anthropomorphizing language models, but hey, you went there first :)

educaysean commented on Sal Khan is pioneering innovation in education again gatesnotes.com/Brave-New-... · Posted by u/Brajeshwar

w10-1 · 2 years ago

Generative AI works by selecting the most likely next words.

But students misunderstand in many different ways. How would it be tailored instead of another way of delivering lesson plans?

Even assuming you already had the topology of the problem space, you'd have to interrogate the student with all possible ways for each question to determine what's missing for them.

But part of the problem is impedance matching: failing students get little positive feedback because they have to correct many mistakes before they start getting things right. Interrogation makes the impedance-matching problem much, much worse (no matter how sweet and enticing the voice).

Sure, I believe in success patterns and strong leaders. But I'd need to see a sketch of how Khan understands and addresses the problems before I'd invest or rely on this as anything other than a stop-gap that continues the under-funding of mass education.

educaysean · 2 years ago

How would an excellent human teacher work with such a failing student? Can that technique be something that the AI could model?

educaysean commented on Math problems with GPT-4o [video] youtube.com/watch?v=_nSmk... · Posted by u/andai

madarco · 2 years ago

oh god, of all the amazing problems to solve with this wonderful technology, you surely did pick the most useless. What's this obsession with interviews on HN? Hired a bunch of people in my career and 1 call was enough with a success ration of 98%

educaysean · 2 years ago

Congratulations on your success ratio. If you don't mind me asking, however, what constitutes a "success" in your eyes? I also interviewed and hired / rejected many applicants through my career, but I don't know if we ever discussed our successes as a single quantitative metric. I'm interested to find out what you measured.

I "picked out" this problem to discuss here because I find the process of "approximating someone else's skills" an interesting endeavor without a clear solution. Do you think the current remote interviewing techniques are effective? Regardless of your answer, it looks like it's going to change dramatically. I find that to be interesting I guess :)

educaysean commented on Math problems with GPT-4o [video] youtube.com/watch?v=_nSmk... · Posted by u/andai

educaysean · 2 years ago

Remote technical interviews are over.

Have GPT4o running on the background and have it type out the answers as your interviewer reads aloud the questions. Share your screen and let it find the bugs for you in real time. Never get any facts wrong as you smugly correct your interviewer about the minute details of an obscure AWS Route 53 API.

educaysean commented on How do you accidentally run for President of Iceland? uxdesign.cc/how-do-you-ac... · Posted by u/simonw

readthenotes1 · 2 years ago

"People don't read" should be tattooed on the back of every designer's hand.

Of course, they won't read it before they design something that requires deep engagement through reading.

educaysean · 2 years ago

Just the designers? Tattoo it on the back of everyone's please

educaysean commented on Fresno High tracks student bathroom visits with app. Not everyone likes it fresnobee.com/news/local/... · Posted by u/e12e

wisty · 2 years ago

So instead of a consistent system, teachers randomly punish kids for going to the bathroom if the teacher feels like the kid might be on their phone based on whether it seems to frequent?

educaysean · 2 years ago

You mean like how our law enforcement works currently? Should the government all put trackers on our vehicles instead? It will consistently prevent things much worse than some students skipping class to go to the bathroom, like hit-and-runs.

educaysean commented on Fresno High tracks student bathroom visits with app. Not everyone likes it fresnobee.com/news/local/... · Posted by u/e12e

educaysean · 2 years ago

This is one of the worst ideas ever. It seems like we love to treat students like they're toddlers until they're 18 then all of a sudden we expect them to act like they're autonomous adults. This isn't how you teach anyone independence.

educaysean commented on Fresno High tracks student bathroom visits with app. Not everyone likes it fresnobee.com/news/local/... · Posted by u/e12e

wisty · 2 years ago

So to ensure students can use a bathroom if they really need to, but aren't abusing it to just take a break every class (missing maybe 25% of their class time? Say 15 minutes per hour class) you'd agree to something like an app to track it?

educaysean · 2 years ago

I'd never agree. What happens if I need more than 15 minutes? Will I be deemed a bad student because I simply need more time to do my business? What does this have to do with how I learn?

educaysean commented on FlipperZero: Our Response to the Canadian Government blog.flipper.net/response... · Posted by u/nickthegreek

ByQuyzzy · 2 years ago

Why are society’s little cupcake nannies so flustered about this innocent gadget?

educaysean · 2 years ago

I doubt they're actually flustered. It seems like they didn't even care enough to learn about what it is.

They just needed something to shove into the scapegoat-shaped hole, and Flipper Zero happened to fit.