They also spent more time in the shop with electrical gremlins than any car that I've ever seen. One of our employees insisted on a Mini, which is also BMW as their lease car. No other car we had had that much trouble. BMW is a crap brand that used to make very good cars. Mercedes is getting there. Neither of these will survive in the longer term if they don't somehow get back to their roots: making safe and reliable vehicles with good resale value. But for both companies the problems are in the same domain: they never got the hang of software.
German government will do everything possible to prevent these companies from failing, no matter how bad the situation gets. This means that current management can simply apply the "next quarter" strategy without any realistic downside.
Seriously, how?
But I actually had in mind the Windows app named "Xbox".
Microsoft cannot and will not ever get better at naming things. It is said the universe will split open and and eldritch beast will consume the stars the day Microsoft stops using inconsistent and overlapping names for different and conflicting products.
Isn't that right .Net/dotnet
CEO: Put AI everywhere/
Engineering Staff: There's a lot of places where it doesn't make sense to do this.
CEO: Do it or find somewhere else to work.
The problem of pushback at the lower levels is that it is completely ineffective when the top levels are set on something.
- having friends is more important than making output, which means that people above certain level just play politics instead of actually managing the company
- managers who miss targets get more people assigned which makes them climb the hierarchy, which means all levels below top level have the incentive to be inefficient
- saying "no" to the ruling party, no matter how stupid the idea is, is the second-easiest way to get replaced. The easiest is to offend the wrong person
- planning periods misaligned with the economic reality
An intelligent person will either be optimized out of the system, or will learn how to game it to their own advantage.
Small companies are more agile and innovative while corporations often just shuffle papers around. Wow, what a bold claim, never seen before in the entire history of economics.
When I was living in Berlin, the entire apartment complex had a WhatsApp group and people would (of course it's Berlin) party a lot. People would ask each other to turn down the volume, which worked for the most part - at least for severe partying. Best messages were like "you've been partying all night, it's 2pm, I need some silence to have a meeting.
Back then I was dreaming of some shared application, people could put on their phone or laptop and then the collective could decide or at least hint through that software that the volume was up too high.
So the model won’t “understand” that you have a skill and use it. The generation of the text that would trigger the skill usage is made via Reinforcement Learning with human generated examples and usage traces.
So why don’t the model use skills all the time? Because it’s a new thing, there is not enough training samples displaying that behavior.
They also cannot enforce that via RL because skills use human language, which is ambiguous and not formal. Force it to use skills always via RL policy and you’ll make the model dumber.
So, right now, we are generating usage traces that will be used to train the future models to get a better grasp of when to use skills not. Just give it time.
AGENTS.md, on the other hand, is context. Models have been trained to follow context since the dawn of the thing.
If you don't have time, just write the damn issue as you normally would. I don't quite understand why one would waste so much resources and compute to expand some lazily conceived half-sentence into 10 paragraphs, as if it scores them some points.
If you don't have time to write an issue yourself or carefully proofread whatever LLM makes up for you, whom are you trying to fool by making it look pretty? At least if it is visibly lazy anyone knows to treat it with appropriate grain of salt.
Even if you are one of those who likes to code by having to correct LLMs all the time, surely you understand if your LLM can make candy out of poo when you post an issue then it can do the exact same thing when it processes the issue and makes a PR. Likely next month it will do a better job at parsing your quick writing, and having it immediately "upscaled" would only hinder future performance.
Because it does. The goal here isn't to create good code, it's to create an impression of a person who writes good code. Even now, when software career is in freefall, for many people in poor countries it's still their only way out of poverty so they'll try everything possible to build a portfolio and get a job and the suffering of your little pet project isn't a part of the equation. Those people aren't trying to get Nobel prizes, they're trying to get any job that isn't farming with literal medieval-era technology.
My very radical personal opinion is that either we have small elitist circles of trust, or the internet will remain a global ghetto.