You will end up with much higher number of trials required to hit the P value than the version with predetermined number of trials and no stopping point by P.
Say, in a single variable single run ABX test, 8 is the usual number needed according to Fischer frequentist approach. If you do multiple comparison to hit 0.05 you need I believe 21 trials instead. (Don't quote me on that, compute your own Bayesian beta prior probability.)
The number of trials to differentiate from a fair coin is the typical comparison prior, giving a beta distribution. You're trying to set up a ratio between the two of them, one fitted to your data, the other null.
Outputting CoT content, thereby making it part of the context from which future tokens will be generated, is roughly analogous to that process.
Almost all the traditional criteria of intelligence - reasoning, planning, decisionmaking, memory etc - are exhibited pretty trivially by standard computer programs. Nevertheless no one would think of them as "intelligent" in the sense that humans or animals are.
On the other hand, we now have LLMs, that sent the entire tech world into a multi-year frenzy, precisely because they appear to possess that human-like intelligence.
And that is even though they perform worse than classical programs in some of the "intelligence" measures: For the first time, we have to worry that a computer program is "bad at math". They cannot reflect on past decisions and are physically unable to store long-term memories. And yet, we're much more likely to believe that an LLM is "intelligent" than a classical program.
This makes me think that our formal decisions of "intelligence" (the ones that would also qualify fungal networks, swarms, cells, societies, etc) and what we intuitively look out for, are really two different things.
Just two? You can name so many more terms in this concept cloud, e.g.: personhood, moral agency, consciousness, self-awareness, processing power, wit, autonomy, feeling-and-experiencing capacity, and so on… We don’t seem to agree on what’s separate from what, and yes, it would be useful.
I think it's obvious that some secrets should be kept. It makes little sense to expose our nuclear secrets, counter espionage, or ongoing investigation efforts. But how far does or should that extend? Should everything the NSA/CIA/FBI/IRS does be secret? Should they stay secret for years or decades or forever?
IMO, the US goes too far in it's secrets. Stuff gets classified that just makes the government look bad and that's dangerous.
And that's where I'm somewhat less concerned about putting US secrets into the cloud. Sure there's highly sensitive stuff that shouldn't go there, but there's also a lot of stuff that shouldn't have been a secret in the first place.
Why has the global economy put such a high benefit from investment bankers compared to, for example, family doctors?
"Nationalism" had more to do with the relationship between the state and the nation, not the existence of nations. The word "nation" is very old.
Nation comes from the Latin "natio" meaning "birth, origin; breed, stock, kind, species; race of people, tribe"[0]. Thus, the essential basis for nationality is familial, a matter of common descent (as all human beings form an extended family, where you draw the line on this blurry map will depend on other factors like culture and language and ultimately the good held in common; note how Croats, Serbs, and Bosnians speak basically the same language, it is the religious and therefore cultural differences that separate them). Naturally, people migrate all the time between nations. That is normal to the degree that migration does not harm the common good of the host society. But immigration is effectively a matter of adoption. We can adopt children. We can also adopt nations.
[0] https://www.etymonline.com/word/nation#etymonline_v_2309
2017: ~5,500 arrests
2019: ~7,734 arrests
2023: ~12,183 arrests
The British arrest stats subsume DV harassment cases, and the original Times reporting quoted a police officer stating that they are the bulk of these numbers. I haven’t found an apples-to-apples comparison in the US, but the FL number gives a point of reference.