Readit News logoReadit News
agilebyte commented on The unreasonable effectiveness of an LLM agent loop with tool use   sketch.dev/blog/agent-loo... · Posted by u/crawshaw
nico · 10 months ago
Cool tool. What format does it expect from the model?

I’ve been looking for something that can take “bare diffs” (unified diffs without line numbers), from the clipboard and then apply them directly on a buffer (an open file in vscode)

None of the paste diff extension for vscode work, as they expect a full unified diff/patch

I also tried a google-developed patch tool, but also wasn’t very good at taking in the bare diffs, and def couldn’t do clipboard

agilebyte · 10 months ago
Markdown format with a comment saying what the file path is. So:

This is src/components/Foo.tsx

```tsx // code goes here ```

OR

```tsx // src/components/Foo.tsx // code goes here ```

These seem to work the best.

I tried diff syntax, but Gemini 2.5 just produced way too many bugs.

I also tried using regex and creating an AST of the markdown doc and going from there, but ultimately settled on calling gpt-4.1-mini-2025-04-14 with the beginning of the code block (```) and 3 lines before and 3 lines after the beginning of the code block. It's fast/cheap enough to work.

Though I still have to make edits sometimes. WIP.

agilebyte commented on The unreasonable effectiveness of an LLM agent loop with tool use   sketch.dev/blog/agent-loo... · Posted by u/crawshaw
_bin_ · 10 months ago
I've found sonnet-3.7 to be incredibly inconsistent. It can do very well but has a strong tendency to get off-track and run off and do weird things.

3.5 is better for this, ime. I hooked claude desktop up to an MCP server to fake claude-code less the extortionate pricing and it works decently. I've been trying to apply it for rust work; it's not great yet (still doesn't really seem to "understand" rust's concepts) but can do some stuff if you make it `cargo check` after each change and stop it if it doesn't.

I expect something like o3-high is the best out there (aider leaderboards support this) either alone or in combination with 4.1, but tbh that's out of my price range. And frankly, I can't mentally get past paying a very high price for an LLM response that may or may not be useful; it leaves me incredibly resentful as a customer that your model can fail the task, requiring multiple "re-rolls", and you're passing that marginal cost to me.

agilebyte · 10 months ago
I am avoiding the cost of API access by using the chat/ui instead, in my case Google Gemini 2.5 Pro with the high token window. Repomix a whole repo. Paste it in with a standard prompt saying "return full source" (it tends to not follow this instruction after a few back and forths) and then apply the result back on top of the repo (vibe coded https://github.com/radekstepan/apply-llm-changes to help me with that). Else yeah, $5 spent on Cline with Claude 3.7 and instead of fixing my tests, I end up with if/else statements in the source code to make the tests pass.
agilebyte commented on Proposed coal plants in the United States   gem.wiki/Category:Propose... · Posted by u/maxwell
14 · 5 years ago
How, with this years heat domes and ever increasing evidence that global climate change will effect the western world in a negative way, are we still even considering burning coal? Please tell me these projects will be scrapped.
agilebyte · 5 years ago
Coal is used in (some) peaking power plants that only turn on during a surge of demand for electricity. Some also use natural gas. It's easier to spin them up/down and can deal with the strain of being used intermittently.
agilebyte commented on Entire German bond yield curve back in sub-zero territory   reuters.com/article/euroz... · Posted by u/nabla9
Phelinofist · 5 years ago
Why should I do this?
agilebyte · 5 years ago
Individuals: Your mattress is not big enough for all your cash and your neighbors are eyeing it too...

Institutions: requirements that they hold a % of their portfolio in safe assets like bonds.

agilebyte commented on DoorDash from Application to IPO   blog.ycombinator.com/door... · Posted by u/todsacerdoti
raverbashing · 5 years ago
So how's their business model different from Deliveroo/Just Eat?
agilebyte · 5 years ago
Raise money, buy competition, raise fees?
agilebyte commented on An update to storage policies across your Google Account   blog.google/products/phot... · Posted by u/e2e4
chadlavi · 5 years ago
so if I have registered MyChildsFullName @ gmail to prevent anyone else from taking it, I now need to make sure it gets at least one email every couple years to prevent them from deleting it before my child is old enough to use it?
agilebyte · 5 years ago
Who are you trying to prevent from taking the handle? Seems like it's either a) adults - that have a legit use for it now, b) soon to be adults that will have a legit use for it sooner than your kid does.
agilebyte commented on A Little Money Laundering Can Have a Big Impact on Real Estate Prices (2019)   betterdwelling.com/how-a-... · Posted by u/throw0101a
apta · 5 years ago
How is the Canadian government allowing this to happen? They should ban Chinese from buying and driving up prices, it's quite obvious what's going on.
agilebyte · 5 years ago
It drives up the price of real estate, all of it. Which means that for those Canadians that bought - it's a huge segment of their net worth. And they vote.
agilebyte commented on India bans TikTok, WeChat, and dozens of other Chinese apps   techcrunch.com/2020/06/29... · Posted by u/samdung
chrisco255 · 6 years ago
We in the U.S. should have partnered more closely with India in the first place. India is democratic and hold values closer to our own. The CCP not so much.
agilebyte · 6 years ago
More democratic than China, yes. A low bar to clear.
agilebyte commented on Santa Cruz, California bans predictive policing in U.S. first   reuters.com/article/us-us... · Posted by u/rbanffy
rudolph9 · 6 years ago
How so? What is your though process on this?
agilebyte · 6 years ago
Given a choice between two neighbourhoods, would I want to move to one that has grow ops or a high rate of social services checkups? Some of that data is already easily available and I know it is being used by real estate agents already. The people that can avoid these neighbourhoods will, which leaves only those that can't. Gentrification.

The neighbourhood I grew up in was heavily mixed (along social strata) which prevented these problems from arising in the first place.

agilebyte commented on Santa Cruz, California bans predictive policing in U.S. first   reuters.com/article/us-us... · Posted by u/rbanffy
rudolph9 · 6 years ago
I wish anonymous metadata and source code was open! This kind of software has potential to positively impact society but not when it cannot be freely audited and debated by the public empowered to vote for changes to it.

I suspect there is a storm brewing with proprietary government software. Social services like child endangerment checkups, child placements, etc. sentencing recommendations, as mentioned predictive policing, watchdog/oversight departments , and so much more!

These tools can empower us or enslave us and a big step toward empowerment is the open source community to push for open source and open data for software augmented government functions!

agilebyte · 6 years ago
If we had access to granular data like this, wouldn't it lead to further gentrification and a further rift within a society?

u/agilebyte

KarmaCake day833April 6, 2011
About
Victoria, BC, Canada https://github.com/radekstepan http://radekstepan.com
View Original