Readit News logoReadit News
reg_dunlop commented on Wired headphone sales are exploding   bbc.com/future/article/20... · Posted by u/billybuckwheat
gruez · 14 hours ago
>now an interaction with a wire sucks.

>But that's not the wire's fault.

So... "it's not the fall that kills you, it's the sudden deceleration"?

reg_dunlop · 3 minutes ago
I believe that's a straw man fallacy.

My assertion is that an inanimate object becomes a problem when a human interacts with it.

Your attempted logical argument is that a law of nature is the same as an inanimate object.

I am not contesting that an inanimate object is the same as gravity.

I'm saying that humans make bad decisions with simple things and whine about inanimate objects and that's a very real first world problem.

reg_dunlop commented on Wired headphone sales are exploding   bbc.com/future/article/20... · Posted by u/billybuckwheat
xnyan · 14 hours ago
Counterpoint: wires really suck and are not fine. AirPods Pro are great, I can afford them and they improve my quality of like quite a bit.

Different strokes I guess.

reg_dunlop · 14 hours ago
counter counterpoint:

A wire sitting on a table does not suck. 2 people can gather around that table and still, the wire does not suck. As soon as 1 person picks up the wire and starts doing something with it....now an interaction with a wire sucks.

But that's not the wire's fault.

reg_dunlop commented on Shall I implement it? No   gist.github.com/bretonium... · Posted by u/breton
boxedemp · a day ago
It has a lot. I find by challenging it often, getting it to explain it's assumptions, it's usually guessing.

This can be overcome by continuously asking it to justify everything, but even then...

reg_dunlop · a day ago
Trust shouldn't be inherent in our adoption of these models.

However, constant skepticism is an interesting habit to develop.

I agree, continually asking it to justify may seem tiresome, especially if there's a deadline. Though with less pressure, "slow is smooth...".

Just this evening, a model gave an example of 2 different things with a supposed syntax difference, with no discernible syntax difference to my eyes.

While prompting for a 'sanity check', the model relented: "oops, my bad; i copied the same line twice". smh

reg_dunlop commented on A standard protocol to handle and discard low-effort, AI-Generated pull requests   406.fail/... · Posted by u/Muhammad523
youknownothing · 8 days ago
Quis custodiet ipsos custodes?

If the problem is that we don't trust people who use AI without understanding its output, and we base the gate-keeping on tests that are written on AI, then how can we trust that output?

reg_dunlop · 8 days ago
Isn't that the purpose of red/green refactoring though? To establish working software that expresses regression, and builds trust (in the software)?

If your premise is that people would shift to using AI to write tests they don't understand, then that's not necessarily a failing of the contributor.

The contributor might not understand the output, but the maintainer would be able to critique a spec file and determine pretty quickly if implementation would be worthwhile.

This would necessitate a need for small tickets, thereby creating small spec files, and easier review by maintainers.

Also, any PR that included a non spec file could be dismissed patently.

It is possible for users of AI to learn from reading specs.

But if agents are doing the entire thing (reading the ticket, generating the PR, submitting the PR)...then the point of people not understanding is moot.

reg_dunlop commented on Payphone Go   walzr.com/payphone-go/... · Posted by u/walz
summermusic · 8 days ago
Real world exploration games like this and Jet Lagged: The Game Hide and Seek are just so cool.

I’d play it if payphones from my state were included! I don’t know if they are licensed/registered here though.

reg_dunlop · 8 days ago
Ya know, I just spun up a version of a user-driven exploration game, as an homage to the sf0.org from back in the aughts. https://irl2-production.up.railway.app/

Google auth still not hooked up, but otherwise good enough for now. And it's open source.

reg_dunlop commented on A standard protocol to handle and discard low-effort, AI-Generated pull requests   406.fail/... · Posted by u/Muhammad523
reg_dunlop · 8 days ago
I'd love to hear some commentary about my idea surrounding this problem of AI PRs.

Why not restrict the agents to writing tests only?

If the tickets are written concisely, any feature request or fix could be reduced to necessary spec files.

This way, any maintainer would be tasked with reviewing the spec files and writing the implementation.

CI is pretty good at gatekeeping based on test suites passing...

reg_dunlop commented on If AI writes code, should the session be part of the commit?   github.com/mandel-macaque... · Posted by u/mandel_x
jiveturkey · 12 days ago
reg_dunlop · 12 days ago
Thank you! Was looking for this company. Founder was high up at GitHub. Really an interesting proposition
reg_dunlop commented on How I use Claude Code: Separation of planning and execution   boristane.com/blog/how-i-... · Posted by u/vinhnx
jamesmcq · 20 days ago
The key part of my comment is "correctly".

Does it write maintainable code? Does it write extensible code? Does it write secure code? Does it write performant code?

My experience has been it failing most of these. The code might "work", but it's not good for anything more than trivial, well defined functions (that probably appeared in it's training data written by humans). LLMs have a fundamental lack of understanding of what they're doing, and it's obvious when you look at the finer points of the outcomes.

That said, I'm sure you could write detailed enough specs and provide enough examples to resolve these issues, but that's the point of my original comment - if you're just writing specs instead of code you're not gaining anything.

reg_dunlop · 20 days ago
To answer all of your questions:

yes, if I steer it properly.

It's very good at spotting design patterns, and implementing them. It doesn't always know where or how to implement them, but that's my job.

The specs and syntactic sugar are just nice quality of life benefits.

reg_dunlop commented on How I use Claude Code: Separation of planning and execution   boristane.com/blog/how-i-... · Posted by u/vinhnx
fourthark · 20 days ago
This is exactly what the article is about. The tradeoff is that you have to throughly review the plans and iterate on them, which is tiring. But the LLM will write good code faster than you, if you tell it what good code is.
reg_dunlop · 20 days ago
Exactly; the original commenter seems determined to write-off AI as "just not as good as me".

The original article is, to me, seemingly not that novel. Not because it's a trite example, but because I've begun to experience massive gains from following the same basic premise as the article. And I can't believe there's others who aren't using like this.

I iterate the plan until it's seemingly deterministic, then I strip the plan of implementation, and re-write it following a TDD approach. Then I read all specs, and generate all the code to red->green the tests.

If this commenter is too good for that, then it's that attitude that'll keep him stuck. I already feel like my projects backlog is achievable, this year.

reg_dunlop commented on Outcome Engineering   o16g.com/... · Posted by u/purplerabbit
burnerToBetOut · a month ago
Those are all swell ideas. Thanks for posting the page!

Now what's meant to happen next?

In practice I mean.

reg_dunlop · 21 days ago
I think it's a matter of implementing different aspects of outcomes into a project/codebase.

It's one thing to add skills and system prompts to a project; it's another to accurately frame and orchestrate the various parts of a context window to maximize the inputs in an efficient way to optimize the outputs of these agents/models we've all begun using.

Practically: explore and influence definitions of context to refine your ability to contribute to the documents that steer the agents.

Or better yet, focus on a part of the context window and figure out how to expose resources responsibly.

Personally, I think local models are the future...but that's just a hunch

u/reg_dunlop

KarmaCake day185April 13, 2022View Original