Readit News logoReadit News
diminish commented on How to build a coding agent   ghuntley.com/agent/... · Posted by u/ghuntley
BenderV · 3 days ago
I'm trying to understand what does it got to do with LLM size? Imho, right tools allow small models to perform better than undirected tool like bash to do everything. But I understand that this code is to show people how function calling is just a template for LLM.
diminish · 3 days ago
Mini swe agent, as an academic tool, can be easily tested aimed to show the power of a simple idea against any LLM. You can go and test it with different LLMs. Tool calls didn't work fine with smaller LLM sizes usually. I don't see many viable alternatives less than 7GB, beyond Qwen3 4B for tool calling.

> right tools allow small models to perform better than undirected tool like bash to do everything.

Interesting enough the newer mini swe agent was refutation of this hypothesis for very large LLMs from the original swe agent paper (https://arxiv.org/pdf/2405.15793) assuming that specialized tools work better.

diminish commented on It is worth it to buy the fast CPU   blog.howardjohn.info/post... · Posted by u/ingve
diminish · 3 days ago
Multi-core operations like compiling C/C++ could benefit.

Single thread performance of 16-core AMD Ryzen 9 9950X is only 1.8x of my poor and old laptop's 4-core i5 performance. https://www.cpubenchmark.net/compare/6211vs3830vs3947/AMD-Ry...

I'm waiting for >1024 core ARM desktops, with >1TB of unified gpu memory to be able to run some large LLMs with

Ping me when some builds this :)

diminish commented on How to build a coding agent   ghuntley.com/agent/... · Posted by u/ghuntley
BenderV · 3 days ago
Nice but sad to see lack of tools. Most your code is about the agent framework instead of specific to SWE.

I've built a SWE agent too (for fun), check it out => https://github.com/myriade-ai/autocode

diminish · 3 days ago
> sad to see lack of tools.

Lack of tools in mini-swe-agent is a feature. You can run it with any LLM no matter how big or small.

diminish commented on Trust in Firefox and Mozilla Is Gone – Let's Talk Alternatives   boilingsteam.com/poll-tru... · Posted by u/ekianjo
diminish · 6 months ago
Firefox's problem is Mozilla; too much ad money made them evil. copying every chrome feature to boost search ads by breaking usability :(

We need a new browser fork! Linux way of governance could be awesome; a star leader/programmer- with community and enterprise contributors.

/just switched to librewolf for the short term; after years on firefox since early 0.x days

diminish commented on Dear friend, you have built a Kubernetes   macchaffee.com/blog/2024/... · Posted by u/todsacerdoti
diminish · 9 months ago
One can build a better container orchestration than kubernetes; things don't need to be that complex.
diminish commented on Electric vehicle battery prices are expected to fall almost 50% by 2026   goldmansachs.com/insights... · Posted by u/doener
diminish · 10 months ago
We need >2000 km ranges in one charge of electric vehicles for widespread adoption.
diminish commented on Refact Code LLM: 1.6B LLM for code that reaches 32% HumanEval   refact.ai/blog/2023/intro... · Posted by u/kateklink
kateklink · 2 years ago
We’ve finished training a new code model Refact LLM which took us about a month. The main use-case is for blazing-fast code completion with fill-in-the-middle, additionally, the model could reply to chat prompts.

It has much better performance than all of the code models of similar size, and almost reaches the same HumanEval as Starcoder being 10x smaller in size.

With the small size, it can work with most modern GPUs requiring just 3GB Ram.

You can try self-hosting it in Refact https://github.com/smallcloudai/refact/ and get a local fast copilot alternative with decent suggestions.

Weights and model card https://huggingface.co/smallcloudai/Refact-1_6B-fim.

We would love to hear your feedback!

diminish · 2 years ago
Does ctransformer (https://github.com/marella/ctransformers#supported-models) support running refact?

I see that model type "gpt_refact" in https://huggingface.co/smallcloudai/Refact-1_6B-fim/blob/mai...

diminish commented on AI won’t replace humans, but humans with AI will replace humans without AI   hbr.org/2023/08/ai-wont-r... · Posted by u/sahin
diego_moita · 2 years ago
I'll use analogies with previous technologies.

Agriculture shaped humans to create buildings, create social classes with different levels of power, create armies, governments and rulers to protect themselves from other tribes. Gradually these grew up became empires and created monotheist religions to substitute animist religions.

The Industrial Revolution shaped humans and culture to move to cities, work in factories/offices instead of home/fields and to think in more abstract terms (math, written words, etc).

Cars shaped human society, governments and urban plans to be heavily dependent on roads and all the car assistance economy. Think Houston/Los Angeles vs. Copenhagen/Amsterdam.

Radio, television and publicity shaped our daily lives to consume a lot of crap we don't really need, from cigarettes to diamond rings.

Social media shaped our political discourse into tribal stupidity and paranoia.

So just use basic Marshall McLuhan: the media is the message. Technology shapes humans and culture. AI will shape humans and culture. We just don't know how.

diminish · 2 years ago
Controversially, agricultural societies late to industrialization, have more population now and are replacing early industrial societies due to lower birth rates in industrial workers.

So I doubt humans with AI will replace humans without.

diminish commented on StableLM: A new open-source language model   stability.ai/blog/stabili... · Posted by u/davidbarker
IceWreck · 2 years ago
StabilityAI is the real OpenAI. Thanks for this.
diminish · 2 years ago
Well said. Openai is a promise unkept. Thanks StabilityAI for existing.

u/diminish

KarmaCake day4706November 21, 2010View Original