Today the setup experience on a brand-new iPhone or Mac is abysmal. Entering the same username and password multiple times - then sometimes a different username and password - competing notifications, irrelevant feature nags, a popup from some random product manager about their pet thingy. Permission questions from some meddlesome privacy team about the feature you just said you wanted to turn on. Uncertainty about whether you’ll break something irreparably by “skipping” the expected setup path. A choice of several inscrutable interface modes because no one has the balls to commit to a single solution. Just terrible.
I guess this is what happens without a dictator to tell people they’re fired for shipping garbage, and when a company worries about meeting quarterly KPIs rather than doing something great.
I don't know what kind of pro-authoritarian sane-washing statement you're trying to make with this line. Jobs himself would tell you that it's a consequence of letting a salesperson run the company rather than a product person.
It means you get 12,000! (Factorial) concepts in the limit case, more than enough room to fit a taxonomy
whisperx input.mp3 --language en --diarize --output_format vtt --model large-v2
Works a treat for Zoom interviews. Diarization is sometimes a bit off, but generally its correct.Thanks but I'm looking for live diarization.
People should check out Subtitle Edit (and throw the dev some money) which is a great interface for experimenting with Whisper transcription. It's basically Aegisub 2.0, if you're old, like me.
HOWTO:
Drop a video or audio file to the right window, then go to Video > Audio to text (Whisper). I get the best results with Faster-Whisper-XXL. Use large-v2 if you can (v3 has some regressions), and you've got an easy transcription and translation workflow. The results aren't perfect, but Subtitle Edit is for cleaning up imperfect transcripts with features like Tools > Fix common errors.
EDIT: Oh, and if you're on the current gen of Nvidia card, you might have to add "--compute_type float32" to make the transcription run correctly. I think the error is about an empty file, output or something like that.
EDIT2: And if you get another error, possibly about whisper.exe, iirc I had to reinstall the Torch libs from a specific index like something along these lines (depending on whether you use pip or uv):
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
uv pip install --system torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
If you get the errors and the above fixes work, please type your error message in a reply with what worked to help those who come after. Or at least the web crawlers for those searching for help.There is no time factor in any absolute entropy equation.
Empirically, if you measure the entropy of a closed system at a given time, and you measure the entropy of that same closed system at a different time, then calculate the deltas of each, their signs match so long as the time delta is finite and the system isn't empty. So stated plainly, as time increases, so does entropy.
By combining these first principle formulae with the empirical results on entropy, you arrive at the second law of thermodynamics. However, like I said before, we're not really sure why the signs match and it's considered to be an unsolved problem in physics.
https://en.wikipedia.org/wiki/List_of_unsolved_problems_in_p...