mcqueenjordan (u/mcqueenjordan)

tptacek · 6 months ago

Hold on, hold on. You're missing a step here.

I agree completely that an LLM's first attempt to write a Semgrep rule is likely as not to be horseshit. That's true of everything an LLM generates. But I'm talking about closed-loop LLM code generation. Unlike legal arguments and medical diagnoses, you can hook an LLM up to an execution environment and let it see what happens when the code it generates runs. It then iterates, until it has something that works.

Which, when you think about it, is how a lot of human-generated code gets written too.

So my thesis here does not depend on LLMs getting things right the first time, or without assistance.

mcqueenjordan · 6 months ago

Yeah, I more or less agree about the closed loop part and the overall broader point the article was making in this context — that it may be a useful use case. I think it’s likely that process creates a lot of horseshit that passes through the process, but that might still be better than nothing for semgrep rules.

I only came down hard on that quote out of context because it felt somewhat standalone and I want to broadcast this “fluency paradox” point a bit louder because I keep running into people who really need to hear it.

I know you know what’s up.

u/mcqueenjordan

KarmaCake day1375July 18, 2016

About

Jordan McQueen Tokyo, Japan

site: https://jm.dev email: j+hn@jm.dev Twitter (eng): @jmq_en

follow:

security: tptacek, moxie, nickpsecurity, strcat, agl, SwellJoe, drewcrawford, schoen, mirimir, secfirstmd, mjg59, userbinator, gorhill, rgovostes, lallysingh, malandrew, mikewest, jedberg, wtarreau, michaelaiello, segmondy, whitequark_, jsnell, salgernon, geofft, ericb

startup: pg, sama, garry, mbesto, coffeemug, davidu, rms, xal, malgorithms, Alex3917, jacquesm, jl, AndrewWarner, emmett, ig1, anateus, lpolovets, ivankirigin, sahillavingia, Joshua, ericflo, immad, rdl, joshfraser, gdb, grellas, gatsby, mayop100, bryanh, josephsunny, ayw, pbiggar, sytse, csallen, joshu, jhuckesteinm, pclark, whockey, sjtgraham, jenthoven, aresant, mayop100, cmdrtaco

code: aphyr, KiranDave, peterwaller, saosebastiao, amirmc, lhorie, judofyr, nikita, huhtenberg, pron, Animats, dmbaggett, chris_wot, aaronbrethorst, grey-area, drewg123, dom96, jashkenas, rich_harris, jordwalke, Homunculiheaded, mark_l_watson, kibwen, Sir_Cmpwn, KenoFischer, ahoyhere, scrollaway, trishume, dbaupp, skrebbel, cryptica, peterhunt, rauchg, TazeTSchnitzel

systems: bcantrill, brendangregg, DannyBee, steveklabnik, fsk, pcwalton, jbk, ajross, yosefk, netguy, minimax, munificent, ColinWright, beat, zwischenzug, derefr, jandrewrogers, shykes, lallysingh, dochtman, SamReidHughes, hnkimb3558, rurban

mods: dang

linux: rwmj, pdkl95

oth: phkahler, davidw, antirez, gwern, patio11, jgrahamc, darksaints, jamwt, nostrademons, plinkplonk, mikekchar, holman, mikeash, edw519, jrockway, noonespecial, staunch, petercooper, jmathai, tzs, jacques_chester, coldtea, peteretep, happy-go-lucky, aaronbrethorst, mtgx

os: vezzy-fnord, rbehrends, vardump, amirmc, pjmlp, rsync, waddlesplash

db: craigkerstiens, teraflop, ifcologne

graphics: pcolton

net: zx2c4, keithwinstein, bsder, walrus01

aws: colmmacc, _msw_, aligouri, illumin8, jcrites, socttlegrand2, openasocket, otterley, mslot

ai: jph00

View Original