The article states that they added a computational correction step which is run whenever a result might be incorrect but skip it most of the time.
I am wondering whether this might cause problems for small real-time systems that do not have an FPU. If the runtime cost of an operation depends on the input, it is very hard to figure out whether deadlines can be met in all operation conditions.
It feels like keeping it alive could really help long-term x64 support on Apple Silicon, even if Apple decides to move on.