Readit News logoReadit News
ghuntley commented on Provide agents with automated feedback   banay.me/dont-waste-your-... · Posted by u/ghuntley
jamesblonde · 20 days ago
I got turned off in the first paragraph with the misuse of the term "back pressure". "back pressure" is a term from data engineering to specifically indicate a feedback signal that indicates a service is overloaded and that clients should adapt their behavior.

Backpressure != feedback (the more general term). And in the agentic world, we use the term 'context' to describe information used to help LLMs make decisions, where the context data is not part of the LLM's training data. Then, we have verifiable tasks (what he is really talking about), where RL is used in post-training in a harness environment to use feedback signals to learn about type systems, programming language syntax/semantics, etc.

ghuntley · 20 days ago
the back pressure terminology comes from me. essentially it’s the wheel - you need to add backpressure to the agentic flywheel.

see https://ghuntley.com/pressure

i have the pleasure to work with moss and he came up with a way to explain what is in my head with ease.

u/ghuntley

KarmaCake day9837May 1, 2009
About
contact via https://ghuntley.com
View Original