I suspect there's a lot we could learn about the health of eggs, for example, if we could just pay some prisoners to eat varying quantities of them for 20 years, then look at the development of their health.
But alignment work has steadily improved role adherence; a tonne of RLHF work has gone into making sure roles are respected, like kernel vs. user space.
If role separation were treated seriously -- and seen as a vital and winnable benchmark (thus motivate AI labs to make it even tighter) many prompt injection vectors would collapse...
I don't know why these articles don't communicate this as a kind of central pillar.
Fwiw I wrote a while back about the “ROLP” — Role of Least Privilege — as a way to think about this, but the idea doesn't invigorate the senses I guess. So, even with better role adherence in newer models, entrenched developer patterns keep the door open. If they cared tho, the attack vectors would collapse.
No current model can reliably do this.
ask> what all tools u have?
I have access to the following tools:
1 code_search: Searches for a pattern in the codebase using ripgrep.
2 extract_code: Extracts a portion of code from a file based on a line range.
3 file_operations: Performs various file operations like ls, tree, find, diff, date, mkdir, create_file.
4 find_all_references: Finds all references to a symbol (function, class, etc.) from the AST index.
5 get_definition: Gets the definition of a symbol (function, class, etc.) from the AST index.
6 get_library_docs: Gets documentation for a library given its unique ID.
7 rename_symbol: Renames a symbol using VS Code. 8 resolve_library_id: Resolves a library name to a unique library ID.
what do i need MCP and other agents for? This is solving most of my problems already.
And for most languages, they shouldn't even be operating on strings, they should be operating on token streams and ASTs
Even though efficient use of CLI tools might make the token burn not too bad, the models will still need to spent extra effort thinking about references in comments, readmes, and method overloading.
Given the dubbing of Gerry Adams, the coverage of Iraq/Afghanistan war crimes, and anything related to Ireland, I don't know you could possibly have believed this.
It was just that pre-Brexit, you agreed with the propaganda.
Deleted Comment
I strongly suspect it's also meant to curtail growing support among youth for Palestine in the Israel/Gaza conflict.
https://www.facebook.com/reel/665564933022223
Essentially creating an internet for children/teens that echos the government narrative.
But broadly I agree, in the sense that the government are uncomfortable with political movements they lack the ability to shape or control.
In hindsight it's incredible just how much influence the British government has historically had over media. The largest TV and radio stations were often directly government owned (BBC, Radio 1, Channel 4) and many newspapers are vulnerable to defamation / contempt of court accusations / injunctions when they sway too far from the official narratives. Especially on any issue adjacent to criminal justice.
Of course, they'll say all of the state owned media operated without political direction. And that regulators / prosecutors operated in a politically neutral fashion with due process and impartiality.
"[firstname].[lastname].[year of birth]@gmail.com"
are still surprisingly common and indicate race, gender and age. These persistent addresses can also be cross referenced to various leaked datasets to get everything from phone numbers to dietary habits.
In reality most things go in an S curve and the excitement fades in a rather boring way. People were worried about bubbles in 2016, but they never really happened.
There was never a mobile bust, or a social bust, or a big data bust. These things just got boring after a while and the valuations stabilised. Sometimes companies went bust and sometimes they kept growing, but the super exponential growth in validation stopped. And we moved on to talking about the next thing.