Readit News logoReadit News
BenderV commented on Claude Advanced Tool Use   anthropic.com/engineering... · Posted by u/lebovic
jondwillis · 3 months ago
well the name “function” is already taken - they deprecated it so that we could call functions, tools.
BenderV · 3 months ago
Well, I think they should have kept calling it function... ^^'
BenderV commented on Claude Advanced Tool Use   anthropic.com/engineering... · Posted by u/lebovic
BenderV · 3 months ago
It feels crazy to me that we are building "tool search" instead of building real tool with interface, state and available actions. Think how would you define a Calculator, a Browser, a Car...?

I think, notably, one of the errors has been to name functions calls "tools"...

BenderV commented on How to build a coding agent   ghuntley.com/agent/... · Posted by u/ghuntley
diminish · 6 months ago
Mini swe agent, as an academic tool, can be easily tested aimed to show the power of a simple idea against any LLM. You can go and test it with different LLMs. Tool calls didn't work fine with smaller LLM sizes usually. I don't see many viable alternatives less than 7GB, beyond Qwen3 4B for tool calling.

> right tools allow small models to perform better than undirected tool like bash to do everything.

Interesting enough the newer mini swe agent was refutation of this hypothesis for very large LLMs from the original swe agent paper (https://arxiv.org/pdf/2405.15793) assuming that specialized tools work better.

BenderV · 6 months ago
Thanks for your answer.

I guess that it's only a matter of finetuning.

LLM have lots of experience with bash so I get they figure out how to work with it. They don't have experience with custom tools you provide it.

And also, LLM "tools" as we know it need better design (to show states, dynamic actions).

Given both, AI with the right tools will outperform AI with generic and uncontrolled tool.

BenderV commented on How to build a coding agent   ghuntley.com/agent/... · Posted by u/ghuntley
diminish · 6 months ago
> sad to see lack of tools.

Lack of tools in mini-swe-agent is a feature. You can run it with any LLM no matter how big or small.

BenderV · 6 months ago
I'm trying to understand what does it got to do with LLM size? Imho, right tools allow small models to perform better than undirected tool like bash to do everything. But I understand that this code is to show people how function calling is just a template for LLM.
BenderV commented on How to build a coding agent   ghuntley.com/agent/... · Posted by u/ghuntley
ofirpress · 6 months ago
We (the Princeton SWE-bench team) built an agent in ~100 lines of code that does pretty well on SWE-bench, you might enjoy it too: https://github.com/SWE-agent/mini-swe-agent
BenderV · 6 months ago
Nice but sad to see lack of tools. Most your code is about the agent framework instead of specific to SWE.

I've built a SWE agent too (for fun), check it out => https://github.com/myriade-ai/autocode

BenderV commented on How to build a coding agent   ghuntley.com/agent/... · Posted by u/ghuntley
normie3000 · 6 months ago
Why are any of the tools beyond the bash tool required?

Surely listing files, searching a repo, editing a file can all be achieved with bash?

Or is this what's demonstrated by https://news.ycombinator.com/item?id=45001234?

BenderV · 6 months ago
Why do humans need a IDE when we could do anything in a shell? Interface give you the informations you need at a given moment and the actions you can take.
BenderV commented on When is it the best time to post on Show HN?   myriade.ai/blogs/when-is-... · Posted by u/BenderV
TruffleLabs · 7 months ago
What is your desired definition of "best"?

Most readers on initial posting?

Quality comments?

Quantity traffic on shown website?

BenderV · 7 months ago
Here, the best time is defined as the highest chance of getting "some" visibility. Most posts quickly fade away.

You are right that there is lots of way to measure this but quality comments is way harder to judge and we don't have quantity traffic info.

u/BenderV

KarmaCake day211August 17, 2013
About
benderv.com

Founder of myriade.ai

Contact me at benderville / at // google mailing platform

View Original