Great article apart from that.
I don't want to be too pedantic, but it's not. Dutch syntax is very similar to German, both are typically analyzed as an SOV (subject-object-verb) language with movement of the finite verb to the V2 position in declarative main clauses, unlike English.
Word order of Dutch and German clauses (+ some other Germanic languages) are typically described in terms of topological fields [1] and Dutch and German have very similar (albeit not the same) topological field constraints, leading to very similar word orders. Like German, Dutch has grammatical genders (three like German, though only two are distinguished in definite articles), similar verb conjugation, etc.
People are often led to believe that Dutch is more similar to English because it doesn't have overt case marking.
[1] Simplified, a clause is partitioned into a vorfeld, mittelfeld, and nachfeld by the so-called brackets, which are the V2 and verb cluster positions.
Do you have recommended references about these grammatical constructs ?
We've rapidly engineered our way to some very impressive models this past decade, and yet gap in our real understanding of what's going on has widened. There's a large list of very basic questions about LLMs that we haven't answered (or in some cases, really asked). This is not a failing of people researching in this area, it's only that things move so quickly there's not enough time to ponder things like this.
At the same time, the result, unless I'm really misunderstanding, gives me the impression that anything other than grid search hyper parameter optimization is a fools errand. This would give credence to the notion that hyper parameter tuning really is akin to just re-rolling a character sheet until you get one that is over powered.
Most of the research is done with simpler models though (because mainly math people do it, and it's hard to prove anything on something as complex as a transformer).
I tried searching on YouTube but didn't find anything particularly unique.
This is a real project, this guy wrote a compiler, to convert common lisp to GLSL (I'm not talking about a simple DSL with code generation, but also type checking and so on).
In his Livestreams, he usually try to implement a specific 3D feature (like triplanar mapping, the Phong of lighting, ...).
Everything is done in a single window, with the program being recompiled while running, pure lisp style.
He is on a hiatus right now though with livestreams, but there is plenty of material already to watch ...
"No," said the zoomers, "we want to write XML, build scaffolding tools and eat shit."
"Your language will never be a LISP," was the God's reply.
>Heat flows from hot to cold because the number of ways in which the system can be non-uniform in temperature is much lower than the number of ways it can be uniform in temperature ...
Should probably say "thermal energy" instead of "temperature" if we want to be really precise with our thermodynamics terms. Temperature is not a direct measure of energy, rather it is an extensive property describing the relationship between change in energy to change in entropy.