we should be able to use this as a vscode extension to solve this issue. is there an sdk to integrate this into electron apps?
We are finalizing an electron app as we speak. That will allow recording anything that runs on your own system
Also, have you heard of Conformer-1 by Assembly-AI[1]? It released a few days ago and supposedly scored higher than Whisper on various benchmarks.
As for speed I have no idea how they make it so fast, but I'm sure they've written about it somewhere. My guess is at least that they are slicing the audio and parallelising it. Will look into Conformer-1 as well!
It is known from experience that this approach does not scale beyond toy examples and very simple websites. UX will begin to suffer from render lag somewhere around the UI complexity of an average webstore.