ChatGPT generated a puzzle game

It's still incredible you can coerce a language model to produce this, but it's not a new game, I've had it on my phone for a few years: https://play.google.com/store/apps/details?id=com.rohitpailw...

samwillis · 3 years ago

And herein lies the issue with ChatGPT, it can generate functioning code, but can also lie through its none existent teeth about it. Using ChatGPT (or Co-Pilot) can feel like pair-programming with a very talented developer who loves to bullshit.

Technotroll · 3 years ago

In this case I think I'd give ChatGPT the benefit of the doubt. It is possible to invent something that already exists, and it has happened on several occasions trough-out history. A great example is the history on who was really first at inventing the telephone. In the end Alexander Graham Bell got the patent, but perhaps Elisha Gray was actually first? Historians remain divided on the topic.

For instance, I once found what I thought was an ingeniously original idea about about how TV is really just a kind of reflection of reality akin to Plato's Cave. I immediately got started writing a thesis about it, but I didn't have to search for long on the topic before I found an entire book written on this way of thinking about television. I wasn't really disappointed, because in the back of my head I knew that it had too be too good to be true that I'd be first with such a great idea. In any case I kept working with the thesis, and I still did got a good grade on it despite the idea not being revolutionary.

The questions I now wonder about is, can ChatGPT forget? Or could it be that ChatGPT was never exposed to this game, but could still infer it through other game rules, such as those for Soduko? Which I guess opens up another rabbit hole on if or how AI can be creative. Which I guess opens up another rabbit hole on how creativity works in general.

olalonde · 3 years ago

> can also lie through its none existent teeth about it

Ironically, it seems to me that you are anthropomorphizing ChatGPT a bit too much here. It has no reason to lie so I think it's more likely that it just doesn't know such game exists. It probably came up with it independently or doesn't have a strong memory of it. In some respect, it would be even more impressive if it was actually "lying through its teeth" because it would imply the AI had some kind of hidden agenda.

jstummbillig · 3 years ago

I am confused at to how this would be "the issue" with ChatGPT. Being wrong and not being aware of it is not a unique concept. At least with ChatGPT it is fair to assume there is no hidden agenda and no need to worry about ill will. If anything that makes it less of an issue, compared to humans.

neilv · 3 years ago

...and plagiarize like crazy, while lying about it. :)

j5155 · 3 years ago

In my experience using copilot for generating code is usually a lot less weird because it has more context; instead of using made up function names and APIs it can see what’s been defined in other files. But I primarily find copilot helpful for instances when I need a bunch of almost identical code but with tiny changes (which could mean I’m coding wrong)

andrepd · 3 years ago

"Very talented developer"? Sorry, I don't think googling my prompt and replying with the top stackoverflow answer (or a mashup of the top answers) counts as a talented developer.

Anecdotal, but I've not yet had any success in producing any non-trivial code with ChatGPT. It has, however, produced copious amounts of bullshit with plausible variable names... :)

visarga · 3 years ago

It is a dilettante, it has not reached the level of "talented" in anything. It knows many things about many things and nothing in depth. Test it on your specialisation, you will see it make absurd mistakes and hallucinations. Try it on a domain you know less about - it looks perfect.

kilgnad · 3 years ago

Bro in this case the human is generating made up garbage.

That game is NOT the same game. It's similar but the games are different.

latexr · 3 years ago

A while ago another poster thought ChatGPT invented good jokes.[1] All of them were ripoffs, which took less effort to verify than it takes to make a new post.

I get people are excited about a chatbot which doesn’t suck, but ideally it wouldn’t turn off critical thinking skills.

[1]: https://news.ycombinator.com/item?id=34744921

ducktective · 3 years ago

Nice find!

Seems to be similar to a game called Kakuro. This [1] repo even contains a similar rule:

> The algorithm exceed the rules that the sum over a row must equal to the value on the left and the sum over a column must be equal to the value on the bottom of the cells with the diagonal and one or two numbers

[1]: https://github.com/MarioBonse/KakuroSolverCSP

[2]: https://github.com/topics/kakuro

hgsgm · 3 years ago

That's the first thing ChatGPT said at the start of the whole process. It created a game like Kakuro.

pmontra · 3 years ago

"If you can think about it somebody already did it and it's on the internet."

Loose quote from I don't remember who, early 90s.

YeGoblynQueenne · 3 years ago

You're probably thinking of (one variation of) Rule 34:

  "Rule 34: If you can imagine it, it exists as Internet porn."

https://en.wikipedia.org/wiki/Rule_34#Variations

omnicognate · 3 years ago

Google Trends shows a small number of searches for "sumplete" going back to 2004 [1]. Not sure how to find what the results might have been, though.

[1] https://trends.google.com/trends/explore?q=Sumplete (search "Worldwide" and extend the time range)

efreak · 3 years ago

Sumplete is Spanish for substitute

fennecfoxy · 3 years ago

But where would GPT have sourced information about how the game works from? That page only has screenshots, I suppose maybe there's a subreddit or something for it as well. Even if there's a bunch of info on it it's still incredibly impressive for it to parse those game rules and turn it into workable code.

Would be nice if GPT could dump the source of how it came to such a solution, if it generated the game by random chance via combining various unrelated chunks of text and mixing up the rules, or if it used some text describing the game you linked.

YeGoblynQueenne · 3 years ago

What are the rules for the phone game you link to? I can't see them on the google store page.

de6u99er · 3 years ago

Great find. I would be amazed if a language model like ChatGPT could come up with a novel idea.

kilgnad · 3 years ago

Except this game is different. It's similar but different.

Your game involves addition. chatGPT is using subtraction.

You guys are funny. An AI generates a game, comes up with the rules, writes the code and designs the web page for it. Your reactions:

- Bah, it's not very fun.

- It's been done before.

- It took too long to make.

Seriously. Let me repeat that. An AI generates a game. It comes up with the rules for the game. It even writes the code and designs the web page for it!

Come on! This is amazing!

tempestn · 3 years ago

The "it's been done before" one is pretty relevant. It means the model didn't actually generate the game, but likely pulled it more or less straight out of its training data. It's still very cool that you can ask it for something and it can basically mine the entire (2021) internet for it, but it's not the same as being able to create something really new.

I've noticed the same thing testing it on various coding questions. It's extremely good at problems that have solutions online. And given stackoverflow, that's a lot of problems. If you manage to hit it with something that it hasn't seen before though, even if it's conceptually very straightforward, it tends to just generate a mix of boilerplate and nonsense.

shynrou · 3 years ago

Exactly. When the first news came out about it's ability to "understand" code, find bugs and improve uppon it, I tested it with some snippets of mine. It just gave boilerplate best practices you find on 100 of blogs, but was not able to make meaningful contribution. It claimed to have introduced a feature while only having found another way to write the same snippet. On other things it straight up invented variables & functions that didn't exist.

As long as the task is in it's training set, it can give you a decent answer, but it can't code it just mimics doing so...

muskmusk · 3 years ago

>The "it's been done before" one is pretty relevant.

But is it? 99.999% of software development has been done before. Even if you do something that is legitimately new (like creating a chatbot that can generate code on demand). Then your solution will still contain more than 99% code that is just a repeat of things that have already been done.

Kiro · 3 years ago

That's not my experience at all. Copilot consistently creates implementations that are very specific to my app and manages to understand the context and problem surface spanning many files. It's not just getting a standard problem and pulls an answer from Stack Overflow.

fennecfoxy · 3 years ago

Even if the rules were inspired by some text that's on the internet rather than a genuine invention (we'll never actually know, we're all just speculating): it hasn't "pulled it out of its training data".

To be asked in plain, simple (ish) English to invent a game, produce code for it and then style it etc and the few other bits the author asked for _is_ impressive.

Why are we asking for so much? Remember the chatbots of the mid-2000s? Eliza etc? They were impressive for the time but GPT represents a _huge_ improvement in this stuff. Of course it's not perfect, but it's an exhilarating jump in capabilities.

jonny_eh · 3 years ago

One could even argue it's a fancy UI that steals content from stackoverflow.

wildmanx · 3 years ago

> An AI generates a game.

But it hasn't! This is just another step in the BS storm coming out of the latest AI hype. The language model has reproduced something that has existed before and was likely part of its training data. That's cool, but it's far from what's being claimed here.

We really need to get better at fact checking this stuff. And with "this stuff" I mean the output of LLMs and other AI frameworks as well as the claims about it. And with "we" I mean society as a whole and our industry in particular. Let's keep the hype in the drawer. The general population can be hyped up about sth, but we should know better, so instead of joining the hype, let's keep a cool head and educate people about what this is and what it isn't.

pharmakom · 3 years ago

yeah its more akin to using Google to find a game on github and copy pasting it.

ahsood · 3 years ago

The second point on your reactions “It’s been done before” is very crucial.

That defeats the point of your argument that “AI generates a game. It comes up with rules for the game”.

No, it doesn’t. It plagiraised the game and pretended to come up with it. It just used a random puzzle game that it had on its training set.

It’s like asking it to write a poem and getting the same exact poem from a random google search. It didn’t come up with it. It just copied it. It’s not as amazing as you say it is.

Also if you look I comments you can see that it’s not even just one game. There are several games that are exactly like that. Which means more probability of having it in a training set.

Kiro · 3 years ago

> It’s like asking it to write a poem and getting the same exact poem from a random google search.

No, it's not. A better comparison would be a poem that feels the same as an existing one and using the same prose but with its own words. Or any musical plagiarism dispute where the song is clearly different but similar enough that it needs to be decided by court. ChatGPT is not just copypasting a puzzle game here.

bob1029 · 3 years ago

It is amazing.

I worry that a lot of otherwise brilliant developers are going to get blindsided by this stuff.

The current models are impressive in strong, quantifiable ways. They are only going to become more powerful from this point.

Consider the current state of affairs: ChatGPT supports a 4K context size. Leaked foundry pricing indicates models that can handle 32K context size. 32K tokens is enough for your entire brand manual or several days worth of call center transcripts. Many products could have the most important parts of their codebase completely loaded into just the prompt.

I would say you should at least try the OpenAI playground (or equivalent technology) to understand what is possible right now. I had no clue where we were at until ~3 weeks ago. I wouldn't wait until 2024 on this one anymore.

olalonde · 3 years ago

Agreed. LLMs are on par with the invention web or the smartphone in terms of how much impact they'll have (possibly more). It's weird to see so many HNers being so dismissive of them. I've been using ChatGPT daily (mostly to ask programming related questions) and it's like having a new super power.

moring · 3 years ago

You'll have to add one reaction to that list: "it didn't generate the game because something like that was in the training data".

Nevermind that this perfectly describes 90% of software development.

I'm actually wondering to what extent these responses are fueled by fear of being replaced by AI.

beepbooptheory · 3 years ago

I know its getting popular, but I really doubt 90% of software development is generated from trained models...

I don't understand what's so at stake with this that you feel like people are afraid. It's fun and amazing it can spit out stuff like this, and if you are a good developer experimenting with this stuff you already know its inarguably a novel and useful utility, if still limited in some ways.

But where is the fire? Why does everything got to devolve into one vague culture war or another? Shouldn't you welcome good faith critique? If only for the fact that these things can still be improved, and how can you hope to improve them if you smother and dismiss every suggestion that these models might be less than perfect.

olalonde · 3 years ago

HN went from "AGI is impossible" to "anything below AGI is worthless" real quick.

postalrat · 3 years ago

The only thing that needs to change for HN to care is for the AI to start writing this stuff in Rust.

codebym · 3 years ago

It is fairly typical of HN to err on the side of cynicism.

Correct me if I'm wrong, but ChatGPT is a very fancy auto-complete function. It's has no ability to create from scratch, just the ability to recompile and recontextualise any of the many existing pieces it has in its library.

It's unlikey that this game or its rules are truely original, ChatGPT will have just plucked it from the library, perhaps given it a new name.

postalrat · 3 years ago

Can you come up with a question to ask it that would prove or disprove your theory?

hgsgm · 3 years ago

What is "scratch"?

"Great artists steal".

Art is defined by remixing the life experience of the author.

ChatGPT's ability to create art is only limited by it's input (text corpus) while humans have images, sound, smell, touch, etc.

isaacremuant · 3 years ago

I don't see how your comment contributes to the discussion. It seems aimed at shutting it down only allowing praise.

The scope of the creation and whether it actually produced something novel is quite important to the discussion and part of the claim (although the author is very open to be proven wrong, in the article).

Your claim n the second to last paragraph is false. That's relevant. This is HN.

matsemann · 3 years ago

Did it come up with the rules, or did it just rehash an explanation for an existing game from its training data?

speedgoose · 3 years ago

The model weights aren’t that big, even compressed with the best algorithm possible.

For such an obscure game I would guess it came up with the rules. It’s difficult to prove though.

PeterisP · 3 years ago

What we're effectively seeing is that when someone demonstrates a talking horse, some people will complain that the horse speaks with a horrible accent and uses impolite language.

xxs · 3 years ago

How do you relate: "An AI generates a game", and then "It's been done before." Obviously it didn't generate a game but it copied, which is not "amazing".

On a flip note: the game is fun enough.

sinuhe69 · 3 years ago

Yes, in some sense it’s. But if the game already exists and the AI just parroted it, it’s significantly less impressive, isn’t it?

Dead Comment