Why do we all fall for AI-generated language?

It is such a weird question. People fall for AI-generated language because the goal of the people who made the generator was to create language like a human would.

Do people wonder why scissors cut paper? Because that is what they were made for!

If the AI wouldn't fool the humans the researchers would be honing it more. Same way if we couldn't make paper cutting scissors there would be people trying to make one.

Am I missing the point here?

mattnewton · 3 years ago

Maybe a bit, they aren’t just asking why scissors cut paper, but also why we landed on that design. What about it makes it ergonomic to hold and efficient, and more to the point, why does being sharp cut it?

In the language model case, why can we model language this way so effectively and why does it follow these statistical patterns? It turns out that maybe a major reason has something to do with self description.

This is also a more interesting question because we understand language less than we understand cutting paper, and also because the process humans used to design large language models is more indirect and alien than traditional industrial design.

My takeaway was that people reading language modeling a person writing about themselves imagine it was written by a person, more than text written by people not about themselves. That’s an interesting trick! Describing human experiences in language makes people attribute the language to a human right now. Maybe this will change after a generation of people knowing about this, but that seems important to think about.

readams · 3 years ago

I think you're missing the point. The question could be possibly rephrased as "what are the tricks that the AI uses to fool humans?" The paper goes on to identify some of the specific tricks that the AIs appear to use. A related question might be "why are we fooled by such simple tricks?"

FFRefresh · 3 years ago

If you follow that question through and think through the implications, it paints potentially a dark future for digital communication.

The 'tricks' AI uses are also 'tricks' that humans use in everyday conversation, we just don't call them 'tricks' when humans are involved. If we start assuming that first-person pronoun usage, mentions of family, etc. are potential signals of AI, then I don't see how we don't end up in a state where increased dehumanization occurs.

dalbasal · 3 years ago

Taking your point, the title should probably be "How we fall for AI-generated language."

dustingetz · 3 years ago

why do we assume that humans are smart? answer: ego; corollary: we are not

dane-pgp · 3 years ago

> A related question might be "why are we fooled by such simple tricks?"

Yes, and to poke at this a bit further, we could ask "What is it that humans are doing when talking that isn't just simple tricks?"

If I had to hazard a guess, I'd say the answer to that is some sort of persistent world modelling, which means an AI may need to have a sense of self before it can move beyond simple tricks.

deltaonefour · 3 years ago

There is a point here that even the twitter poster is missing. This machine is imitating a human better than a human itself, and we now have statistics to prove it.

The question is... if statistically it's better at appearing human than a human itself, than is this machine really just a language generator? Or is it something more?

I'm not saying these things are sentient. But the AI has risen to the point where this question is becoming ask-able. We are at the border here. If we can't ask this question now, then we are really damn close.

A lot of pretentious people on HN claim absolutely that our best chat bot technology is clearly not sentient. It reminds me of the beginning of the COVID pandemic when the CDC said masks were ineffective and you had a bunch of know-it-alls and armchair experts just repeating that BS over and over again as if they knew what they were talking about.

I think if these pretentious people were actually intelligent they would know that we actually do not HAVE enough information to make a claim in EITHER direction. We can't know if it's actually sentient or not.

That fact in itself is both interesting and compelling

4 or 5 years ago chatbots COULD not imitate humans and were CLEARLY fake. What we're seeing unfold before our eyes is a first.

We're not even sure what sentience is. But we do know that humans are sentient. And we don't know if whatever is going on inside of these chatbots is comparable with what's going on inside human brains.

Thus when given a chatbot that imitates humans perfectly, it's actually impossible to know if it's sentient.

simonh · 3 years ago

It depends on whether the estimation comes from pretension, or comes from actual understanding of what these bots are doing. The fact is most people on the street, including me, are very easy to fool with linguistic tricks and are quite poor at investigating unfamiliar situations. Advertising and marketing exist because of this. It’s also why we have very strict laws and regulations controlling advertising and how products are sold. Otherwise a lot of people would be very easy to rip off with very simple misdirection.

What these language models are doing is automated misdirection. They are taking an input text and transforming it based on rules, but they have absolutely no understanding of any of it. This is very, very easy to demonstrate if you know how the models work. You can sit down and generate hundreds of questions one after the other that demonstrate this very easily if you understand the process.

The problem is that people instinctively proceed from the assumption that the system they are talking to might be human and give it a fair chance by asking answerable questions. Since it’s trained on answerable questions it often gives a reasonable answer. But if you ask even slightly unanswerable questions the system plods on mechanically trying to answer it anyway and produces gibberish, exposing the flaws in the mindless rote process it’s following.

yarg · 3 years ago

What's the underlying structure that leads to the complex behaviour?

It's not asking about the motivation for the form, it's asking about the properties of the structure.

With scissors neither the mechanism or the behaviour is particularly complex, so actually understanding what's going on isn't too much of a struggle.

With deep-ML you're dealing with a very large number of entanglements of increasingly abstract and opaque higher dimensional concepts or notions (depending on how much you want to anthropomorphise the machine).

maurice__ · 3 years ago

Interesting angle! We're showing that people fall for generated language not necessarily because generated language is all that good, but because people are looking for the wrong cues and are thus bad at identifying generated language.

Back to the scissors, it would be like someone trying to tell you that their scissors cut marvelously, but you ask yourself whether the paper they are demonstrating them on is strong enough to prove that. Making something for cutting doesn't guarantee that it will cut and doesn't explain why.

I even suspect that in the early days of scissors it was all that clear why they worked. Similarly, we don't understand much about GPT-3. It was trained to predict the next token in a sequence, not to create an illusion of personhood. But somehow it does so, and we're trying to understand how and why.

dvt · 3 years ago

This is actually a pretty good comment. The scissor analogy, while a bit on-the-nose, is very accurate. Maybe a better example would be: why is our body fooled by artificial hearts? Simply because it was built in such a way that it simulates a real heart pretty well.

Similarly, these models are built in such a way that they simulate real-life conversations pretty well. There's nothing really more to it. In my view, this phenomenon has nothing to do with intelligence or how smart we are, or whatever.

deltaonefour · 3 years ago

Analogies aren't proof for anything.

I'm pretty sure though, we obviously can't prove these chatbots are sentient. Clearly.

However, for the first time, we ALSO cannot prove that these chatbots aren't sentient. The statistics are proof of that.

What you and the parent poster are describing here are simply opinions. We are at a point where the null hypothesis and the hypothesis itself cannot be proven. And that is compelling.

The pretentiousness of a lot of people is astounding. That statistic no matter how you look at it is a compelling statement about AGI, independent of whether or not these chatbots are AGIs.

synu · 3 years ago

Researching semi-obvious things like what elements of AI-generated text humans mistake for being human-generated is part of the process of how people working on AIs work towards better generation. You’re just seeing how the sausage gets made.

mannykannot · 3 years ago

Goals are not automatically satisfied, let alone well-satisfied. The question is asking what it is about us that allows the methods used to be particularly effective.

joe__f · 3 years ago

I got why scissors cut paper, but I'm always stuck wondering why paper wraps rock...

I had to mark about 100 end of term essays written by Indian students for a British university. My unwritten instructions are not to take language into account much. I must attend mainly to the technical content.

At least half were written in what I took to be an authentic voice but with such bad grammar and spelling as to render them barely readable. Some had clearly been mangled in a laundromat of Google translate from Hinglish via Mongolian and Swahili. They contained bizarre phrases and comical statements. Many more were obviously written by some kind of generator and fudged until they read well enough.

Since the student handbook states the threshold for academic "plagiarism" is above 20 percent perhaps unsurprisingly the Turnitin (an awful tool) score for almost every essays was just below 20 percent. An interesting clustering!

Students who cheat have a formidable array of tools now, not just GPT but automatic re-writers and scripts to test against Turnitin until it passes.

Add to this problem that my time for marking is not paid extra, is squeezed tighter every semester, and that students are given endless concessions to boost their "experience". The handbook also says that if they fail, no worries, they get to try again, and again, and again... and I am sure if I actually stuck to my guns and failed every single student I'd be fired.

As I wrote in the Times last year, I think the technological arms race against GPT (and the economic conditions that mean it's used) cannot be won with the time and resources available to ordinary human teachers.

matkoniecz · 3 years ago

> cannot be won with the time and resources available to ordinary human teachers

By your description it clearly appears that whoever manages your company[1] is not actually interested in detecting cheating.

What you describe could be easily combated by giving teachers ability to fail blatant cheaters.

[1]At this point it is hard to pretend that it is university

nonrandomstring · 3 years ago

I agree with every word you say.

User23 · 3 years ago

> As I wrote in the Times last year, I think the technological arms race against GPT (and the economic conditions that mean it's used) cannot be won with the time and resources available to ordinary human teachers

Based on the rest of your post, there appears to be a stronger case that your students are setting a rather low bar for GPT to stumble over. It's unfortunate that there are so many cultures where widespread cheating is condoned, if not outright encouraged. They may be able to fool their teachers, but how much comfort will that be when the bridges are collapsing, the pipelines are exploding, the wind turbines are breaking apart, and all the other activities that ultimately report to reality and not some human superior who can be bluffed become impossible to continue?

nonrandomstring · 3 years ago

> It's unfortunate that there are so many cultures where widespread cheating is condoned, if not outright encouraged. They may be able to fool their teachers, but how much comfort will that be when the bridges are collapsing, the pipelines are exploding, the wind turbines are breaking apart, and all the other activities that ultimately report to reality and not some human superior who can be bluffed become impossible to continue?

You're so right. But let me add some other feelings, so as not to sound like a racist or that British universities are some "great white hope" to overseas students. This had little to do with them being Indian. It's a generational thing. In all cultures we teach young people to game systems. Right from the get go they learn that if they can buy powerful tools, systems and access then that's fair game. They're just doing what they've been rewarded for their whole lives and want to make a better life. To them it's not cheating. I am the anachronistic throwback here I think.

schroeding · 3 years ago

> My unwritten instructions are not to take language into account much. I must attend mainly to the technical content.

Interesting, at my non-English, run-off-the-mill university there were modules / seminars in CompSci where large amounts of language errors in essays (even if written in English, a non-native language for the majority of staff and students) could ruin the grade. ^^

gs17 · 3 years ago

My undergrad even had a "banned error list", which would get you kicked down half a grade on any paper.

milkey_mouse · 3 years ago

> automatic re-writers and scripts to test against Turnitin until it passes

Like an ad-hoc GAN where Turnitin is the discriminator. Interesting.

Aardwolf · 3 years ago

> Students who cheat have a formidable array of tools now, not just GPT but automatic re-writers and scripts to test against Turnitin until it passes.

Conduct tests in a room with all electronics confiscated

RicoElectrico · 3 years ago

What major was it?

nonrandomstring · 3 years ago

I can't say. That would identify the students and that's unfair.

But, a technical subject that could be assessed in other, better ways [1], and for which written essays are rather easy to template and do keyword bingo to get a bare pass.

[1] Making the professor read 100 essays is a cheap option.

Deleted Comment