There are two ways to parse your first sentence. Are you saying that you used whisperX and it doesn't do well with diarization? Because I am curious of alternative ways of doing that.
I am also eyeing whisperX[2], because I want to play some more with speaker diarization.
Your use-case seems to be batch transcription, so I'd suggest you go ahead and just use whisperfile, it should work well on an M4 mini, and it also has an HTTP API if you just start it without arguments.
If you want more interactivity, I have been using Vibe[3] as an open-source replacement of SuperWhisper[4], but VoiceInk from a sibling comment seems better.
Aside: It seems that so many of the mentioned projects use whisper at the core, that it would be interesting to explicitly mark the projects that don't use whisper, so we can have a real fundamental comparison.
[1] https://huggingface.co/Mozilla/whisperfile
[2] https://github.com/m-bain/whisperX
Maybe it will deteriorate it, but it seems that the effect that different charge types have on batteries may not be complete yet.
$ lynx https://mastodon.social/
[…]
To use the Mastodon web application, please enable JavaScript.
Alternatively, try one of the native apps for Mastodon for your
platform.
The C2 Wiki does not load either: $ lynx https://wiki.c2.com/
[…]
javascript required to view this site
why
To their credit, at least they use the <noscript> tag to display the above notices. Some websites don't even bother with that. But there are many old school websites that still load fine to varying degrees: lynx https://danluu.com/ # Mostly okay but some needed spaces missing
lynx https://en.wikipedia.org/ # Okay, but a large wall of links on top
lynx https://irreal.org/blog/ # Renders fine
lynx https://libera.chat/ # Mostly fine
lynx https://news.ycombinator.com/ # Of course!
lynx https://sachachua.com/ # Mostly fine
lynx https://shkspr.mobi/ # Renders really well
lynx https://susam.net/ # Disclosure: This is mine
lynx https://norvig.com/ # A classic!
lynx https://nullprogram.com/ # Also pretty good
If you have more examples, please comment, and I'll add them to this list in the two hour edit window I have.While JavaScript has its place, I believe that websites that focus on delivering primarily text content could prioritise working well in TUI browsers. Sometimes testing it with text-based browsers may even show fundamental issues with your HTML. For example, several times, I've seen that multiple navigation links next to each other have no whitespace between them. The links may appear like this:
HomeBlogRSSAboutCodebergMastodon
Or, in a list of articles, dates and titles may appear jammed together: 14 Mar 2025The Lost Art of Dual Booting
15 Mar 2025Some Forgotten Features of Gopher
16 Mar 2025My Favourite DOS Games
The missing spaces aren't obvious in a graphical browser due to the CSS styling hiding the issue, but in a text-based one, the issue becomes apparent. The number of text-based web users may be shrinking, but there are some of us who still browse the web using tools like lynx, w3m, and M-x eww, at least occasionally. Just in case you haven’t tried it, the blog also works really well with terminal-based browsers, such as Lynx and ELinks. Go ahead and give it a shot. The header that normally appears at the top of the page is actually at the bottom of the HTML document structure. It’s out of the way for browsers that ignore CSS.
[1] https://nullprogram.com/blog/2017/09/01/We generate an ephemeral TLS root CA certificate and inject it into the system store. The generated certificate is entirely in-memory and never leaves the machine. To make this work without root privileges, we intercept the open(2) syscall to see if it's /etc/ssl/certs/ca-certificates.crt (or equivalent). If so, we append the ephemeral root CA to the list of actual CA certificates; if not, we let the kernel handle the file open like usual. This way, none of the other programs are affected, so only the program you start with `subtrace run` sees and trusts the ephemeral root CA.
After we get the program to trust the ephemeral root CA, we can proxy outgoing TLS connections through Subtrace transparently but also read the cleartext bytes.
All of this is fully automatic, of course.
This is especially useful because link-rot means that resources I was able to uncover might not be available in the future. A few years ago I did a massive amount of research into internals of old unixes for data recovery and maintenance of said systems in the modern era(machines attached to million dollar pieces of testing equipment go away when the machine does). I was maintaining and upgrading(mostly scsi2sd) and backing up systems that all predated y2k. Most of my research references are now dead links to nowhere. I now print to pdf as well as take archive.is links of all my referenced sources.
I'm generally terrible about blogging, but I'm changing that for 2025. I'm now in a position where I'm solutions architecting a lot of things as my primary day job. This makes easy blog post subjects that not only clarify my thoughts and understanding, but end up being the basis for the internal documentation on the subject.
A lot of what I now do is in terraform, cloudformation, golang, or Python. I make sure when I publish my blog post, I include a complete working example. For all my terraform, all one has to do is clone and run terraform apply, after satisfying the barebones prerequisites.
I found this[2] very enlightening.
[1] https://github.blog/open-source/git/get-up-to-speed-with-par...
[2] https://www.howtogeek.com/devops/how-to-use-git-shallow-clon...