Readit News logoReadit News
corysama · 2 years ago
Unfortunately, just this week someone fine-tuned the Mistral-7B LLM to play DOOM :P

https://news.ycombinator.com/item?id=39813174

paulryanrogers · 2 years ago
For very modest definitions of playing. Perhaps it'd be more impressive if they recorded a demo file and let that play back without the realtime overhead? Even so it can only move in forward, back, turn, and fire. And only knows to face away from the wall it's collided with. This is so far below even basic Doom bots that I'd be afraid to call it playing.

The ASCII intermediate interpretation also seems unnecessary and very limiting. But perhaps that's to keep it near realtime, looks like 1 FPS?

And why run on a Mac? Why not a beefy PC with a GPU that can do the calculations faster?

Still, does seem like a fun challenge. Maybe with further tuning or training it can level up

lobocinza · 2 years ago
Reminded me of "Growing Living Rat Neurons To Play... DOOM?"

https://www.youtube.com/watch?v=bEXefdbQDjw

wahnfrieden · 2 years ago
any models fine tuned for playing an open src game that is non-GPL so that it can be deployed to the app store for interesting bot play ideas?
brcmthrowaway · 2 years ago
How could this possibly be in the training set?
corysama · 2 years ago
It’s not. The fine tuning taught the LLM how to give single-character responses (move/fire keyboard controls) in response to a sequence of ASCII-art-ized frames of the game being played.
sugarkjube · 2 years ago
Absolutely love it. Unusual captcha's are great.

Reminded me of this one: http://random.irb.hr/signup.php

esaym · 2 years ago
Funny. I made a captcha challenge of calculus problems for a comment section on my personal blog page. But 5 years after college, I couldn't remember how to even do them myself so I changed it :-/
iopq · 2 years ago
wolfram alpha can do it for you
evgpbfhnr · 2 years ago
You don't actually need much, for a form I used to get spam in I just added a "write 42 here" so anyone who actually cares to read would be able to fill it. spam fell to 0.

(for a site with a slightly higher profile this wouldn't be enough, but for a minor corner of the internet with no ill intent actually aimed at it that turned out to be enough to block the fuzzing "fill all the forms" spam)

electroly · 2 years ago
As contrasting experience, I did that (a simple math problem) on our contact form and it did NOT drop spam to zero; our spammers were too smart for that. Even an actual reCAPTCHA didn't completely eliminate it (although it mostly did, enough that it's fine for us).
kqr · 2 years ago
Similarly an empty input field that is css'd to be outside the viewport is often filled by spambots but not humans. But I like the edge case UX of your idea more.
koito17 · 2 years ago
The question I got was surprisingly simple: it asked to find "the least real root of the polynomial p(x) = (x+5)(x-4)(x+1)". A determined attacker can quickly hack together something with Tesseract and feed it into even GPT-3.5 to get the correct answer to questions like these.

I guess that means the captcha is doing its job, since running LLMs isn't very cheap or scalable. But any harder problem means you start filtering a significant chunk of human users. Based on the other replies to your comment, it seems that the questions at their current difficulty already stop a lot of human users, yet allow a determined attacker with the setup I described pass through easily.

explaininjs · 2 years ago
I'm not sure how you'd determine the least real root to that, given all three have equally zero imaginary component.
baud147258 · 2 years ago
I remember an old (and now defunct) fan site who hit you with lore questions as a captcha. Though I'd guess a LLM could answer
onlyrealcuzzo · 2 years ago
Can I play by an audio call if I'm visually impared?
Keyframe · 2 years ago
Yes, when you hear a monster roar you say BANG!
Kwpolska · 2 years ago
The first one I got was 7 * 7 + (-3). That’s trivial, elementary-school-level math, and did they really need LaTeX to render that?

Then I refreshed the page, and was hit with calculus involving trig functions.

marvinborner · 2 years ago
Or the one on esolangs.org where you need to evaluate some random Befunge code.
em-bee · 2 years ago
after reloading a dozen times i finally got one that i could solve:

-3 * 3 + (-3) = ?

jakderrida · 2 years ago
I just got one I think I can solve: 0 + 7 + 0 = ?

Where's my calculator?

nottorp · 2 years ago
I got "find the last real zero of the polynomial..." but what does last mean? Largest? Last as the polynomial's factors are given? Something else?

Edit: oh wait. It's "least". I really have no idea then :)

jelder · 2 years ago
It let me through despite trying to attack a cacodemon with a pistol.

With it being so famously portable, I was expecting this to actually run Doom in the browser and complete a simple map.

sira04 · 2 years ago
I'm still waiting for someone to make the Mona Lisa Captcha: https://www.youtube.com/watch?v=WqnXp6Saa8Y
wanderer2323 · 2 years ago
Absolute banger. But the auto-aim on vertical axis is missing. You should be able to have the crosshair under an enemy and still hit them. But in any case, nicely done!
daveslash · 2 years ago
Funny enough, when I've tried to introduce (indoctrinate) friends to DOOM, "how do I aim up" has consistently been the biggest hangup.

This makes sense when I try to indoctrinate my teenager who grew up on Halo and Call of Duty. But I began noticing this hangup in the late 90s with friends my own age.

evrimoztamur · 2 years ago
Here's the real Doom player!
modeless · 2 years ago
Why isn't it actually Doom? Surely there are multiple JS Dooms to choose from.
tiltowait · 2 years ago
"Finish UV Hangar in < 13 seconds."

Easily achievable[0], thoroughly obnoxious[1]. Just like all captchas.

[0] God help you if you're on a touchscreen. [1] For most people. Especially after the novelty wears off.

kadoban · 2 years ago
Doom is still under copyright protection last I knew. The source is GPL, but have the assets ever been liberally licensed? I think they're more abandonware.

I'm sure you could still do it, but personally I try to respect copyright strictly for any projects I'm going to share. It just feels annoying to have copyright nonsense hanging over me otherwise.

modeless · 2 years ago
Well certainly we don't need the full game assets for a captcha. The shareware version would do just fine and that's always been free.
Solvency · 2 years ago
Yeah kind of bummed me out.
jml7c5 · 2 years ago
You should try for a full 3D implementation of Doom! I'm sure it's been ported to JavaScript at least a dozen times.
taneq · 2 years ago
Wny stop there when you could just use a webassembly port of the actual game with hacked-in portal to the actual site somewhere... :P
nottorp · 2 years ago
For bonus points fire up a Windows VM that will run the original Doom files...

Or maybe a remote desktop into an OS with a sandboxed browser that runs a Windows VM that ...

Dowwie · 2 years ago
I want a doom progress window that allows a user to play doom while waiting for a task to complete