Readit News logoReadit News
TachyonicBytes commented on Ask HN: What API or software are people using for transcription?    · Posted by u/indigodaddy
oulipo · 6 months ago
Yes, but I guess it includes the "payment check" in the pre-built releases? Not sure but perhaps
TachyonicBytes · 6 months ago
It has the payment button at least
TachyonicBytes commented on Ask HN: What API or software are people using for transcription?    · Posted by u/indigodaddy
satvikpendem · 6 months ago
DiCoW-v2 seems to work better than whisperX for diarization, by the way.

https://pccnect.fit.vutbr.cz/gradio-demo/

TachyonicBytes · 6 months ago
It seems that both use / leverage pyannote. I wonder if the whisperX pipeline can be combined with DiCoW-v2.
TachyonicBytes commented on Ask HN: What API or software are people using for transcription?    · Posted by u/indigodaddy
levocardia · 6 months ago
I have used whisperX with success in a variety of languages, but not with diarization. If the goal is to use the transcript for something else, you can often feed the transcript into a text LLM and say "this is an audio transcript and might have some mistakes, please correct them." I played around with transcribing in original language vs. having whisper translate it, and it seems to work better transcribing in the original language, then feeding into an LLM and having that model do the translation. At least for french, spanish, italian, and norwegian. I imagine a text-based LLM could also clean up any diarization weirdness.
TachyonicBytes · 6 months ago
Yes, this is exactly where I am going. The LLM also has an advantage, because you can give it the context of the audio (e.g. "this is an audio transcript from a radio show about etc. etc."). I can foresee this working for a future whisper-like model as well.

There are two ways to parse your first sentence. Are you saying that you used whisperX and it doesn't do well with diarization? Because I am curious of alternative ways of doing that.

TachyonicBytes commented on Ask HN: What API or software are people using for transcription?    · Posted by u/indigodaddy
oulipo · 6 months ago
I'm using VoiceInk which is great and open-source (free if you build it yourself!)
TachyonicBytes · 6 months ago
It seems to be pre-built on github, in releases.
TachyonicBytes commented on Ask HN: What API or software are people using for transcription?    · Posted by u/indigodaddy
TachyonicBytes · 6 months ago
I use whisperfile[1] directly. The whisper-large-v3 model seems good with non-English transcription, which is my main use-case.

I am also eyeing whisperX[2], because I want to play some more with speaker diarization.

Your use-case seems to be batch transcription, so I'd suggest you go ahead and just use whisperfile, it should work well on an M4 mini, and it also has an HTTP API if you just start it without arguments.

If you want more interactivity, I have been using Vibe[3] as an open-source replacement of SuperWhisper[4], but VoiceInk from a sibling comment seems better.

Aside: It seems that so many of the mentioned projects use whisper at the core, that it would be interesting to explicitly mark the projects that don't use whisper, so we can have a real fundamental comparison.

[1] https://huggingface.co/Mozilla/whisperfile

[2] https://github.com/m-bain/whisperX

[3] https://github.com/thewh1teagle/vibe/

[4] https://superwhisper.com/

TachyonicBytes commented on BYD says new fast-charging system could be as quick as filling up a tank   theguardian.com/technolog... · Posted by u/gokhan
mansoorsheriff · 9 months ago
Won't too much fast charging deteriorate the battery health faster?
TachyonicBytes · 9 months ago
There was an article around here about how battery life actually improves if you ultra-fast charge the battery when you make it.

Maybe it will deteriorate it, but it seems that the effect that different charge types have on batteries may not be complete yet.

TachyonicBytes commented on Lynx is the oldest web browser still being maintained    · Posted by u/jahnu
susam · 9 months ago
It is unfortunate that modern web development has led to websites so complex that they either break entirely or look terrible in text-based browsers like Lynx. Take Mastodon, for example:

  $ lynx https://mastodon.social/
  […]
  To use the Mastodon web application, please enable JavaScript.
  Alternatively, try one of the native apps for Mastodon for your
  platform.
The C2 Wiki does not load either:

  $ lynx https://wiki.c2.com/
  […]
  javascript required to view this site
                   why
To their credit, at least they use the <noscript> tag to display the above notices. Some websites don't even bother with that. But there are many old school websites that still load fine to varying degrees:

  lynx https://danluu.com/            # Mostly okay but some needed spaces missing
  lynx https://en.wikipedia.org/      # Okay, but a large wall of links on top
  lynx https://irreal.org/blog/       # Renders fine
  lynx https://libera.chat/           # Mostly fine
  lynx https://news.ycombinator.com/  # Of course!
  lynx https://sachachua.com/         # Mostly fine
  lynx https://shkspr.mobi/           # Renders really well
  lynx https://susam.net/             # Disclosure: This is mine
  lynx https://norvig.com/            # A classic!
  lynx https://nullprogram.com/       # Also pretty good
If you have more examples, please comment, and I'll add them to this list in the two hour edit window I have.

While JavaScript has its place, I believe that websites that focus on delivering primarily text content could prioritise working well in TUI browsers. Sometimes testing it with text-based browsers may even show fundamental issues with your HTML. For example, several times, I've seen that multiple navigation links next to each other have no whitespace between them. The links may appear like this:

  HomeBlogRSSAboutCodebergMastodon
Or, in a list of articles, dates and titles may appear jammed together:

  14 Mar 2025The Lost Art of Dual Booting
  15 Mar 2025Some Forgotten Features of Gopher 
  16 Mar 2025My Favourite DOS Games
The missing spaces aren't obvious in a graphical browser due to the CSS styling hiding the issue, but in a text-based one, the issue becomes apparent. The number of text-based web users may be shrinking, but there are some of us who still browse the web using tools like lynx, w3m, and M-x eww, at least occasionally.

TachyonicBytes · 9 months ago
I have to add https://nullprogram.com, just because of the care the author took to have it work better in lynx[1]:

    Just in case you haven’t tried it, the blog also works really well with terminal-based browsers, such as Lynx and ELinks. Go ahead and give it a shot. The header that normally appears at the top of the page is actually at the bottom of the HTML document structure. It’s out of the way for browsers that ignore CSS.
[1] https://nullprogram.com/blog/2017/09/01/

TachyonicBytes commented on Show HN: Subtrace – Wireshark for Docker Containers   github.com/subtrace/subtr... · Posted by u/adtac
adtac · 10 months ago
Yes, but we've managed to do it automatically without any library/language specific hooks! It's probably one of my favourite things in Subtrace :)

We generate an ephemeral TLS root CA certificate and inject it into the system store. The generated certificate is entirely in-memory and never leaves the machine. To make this work without root privileges, we intercept the open(2) syscall to see if it's /etc/ssl/certs/ca-certificates.crt (or equivalent). If so, we append the ephemeral root CA to the list of actual CA certificates; if not, we let the kernel handle the file open like usual. This way, none of the other programs are affected, so only the program you start with `subtrace run` sees and trusts the ephemeral root CA.

After we get the program to trust the ephemeral root CA, we can proxy outgoing TLS connections through Subtrace transparently but also read the cleartext bytes.

All of this is fully automatic, of course.

TachyonicBytes · 10 months ago
Is this a different method from the httptap [1] that was on hackernews a few weeks ago? Somebody in that post seemed to say that it also generates CA certificates on the fly.

[1] https://news.ycombinator.com/item?id=42919909

TachyonicBytes commented on Why blog if nobody reads it?   andysblog.uk/why-blog-if-... · Posted by u/alexgiann
dwood_dev · 10 months ago
I generally blog about things that took a week+ of research and I want to save some other poor bastard the pain of what I had to piece together.

This is especially useful because link-rot means that resources I was able to uncover might not be available in the future. A few years ago I did a massive amount of research into internals of old unixes for data recovery and maintenance of said systems in the modern era(machines attached to million dollar pieces of testing equipment go away when the machine does). I was maintaining and upgrading(mostly scsi2sd) and backing up systems that all predated y2k. Most of my research references are now dead links to nowhere. I now print to pdf as well as take archive.is links of all my referenced sources.

I'm generally terrible about blogging, but I'm changing that for 2025. I'm now in a position where I'm solutions architecting a lot of things as my primary day job. This makes easy blog post subjects that not only clarify my thoughts and understanding, but end up being the basis for the internal documentation on the subject.

A lot of what I now do is in terraform, cloudformation, golang, or Python. I make sure when I publish my blog post, I include a complete working example. For all my terraform, all one has to do is clone and run terraform apply, after satisfying the barebones prerequisites.

TachyonicBytes · 10 months ago
I use Zotero[1] as a personal web archiver. It downloads the page locally, placing most of the resources inside a single html file (pictures become base64 encoded pngs, for example). I find it the best way to have the content available offline and also to be able to reference it easily, seeing as it is a citation manager first.

[1] https://www.zotero.org/

TachyonicBytes commented on Git clone –depth 2 is vastly better than –depth 1 if you want to Git push later   stackoverflow.com/questio... · Posted by u/jakub_g
jbreckmckye · 10 months ago
Why can't git push, when it encounters a `.git/shallow`, just ask the git server to fill in the remaining history by verifying the parent hashes the client can send?
TachyonicBytes · 10 months ago
It can, but that's another type of "shallow", or more exactly "not-deep" cloning, called blobless cloning [1]. There is also treeless cloning, with other tradeoffs, but much to the same effect.

I found this[2] very enlightening.

[1] https://github.blog/open-source/git/get-up-to-speed-with-par...

[2] https://www.howtogeek.com/devops/how-to-use-git-shallow-clon...

u/TachyonicBytes

KarmaCake day125December 3, 2022
About
Building a WebAssembly consultancy at @tachyonicbytes. Hire me to port your project to WASM or to start a new WASM project.

Interested in wasmer, wasmtime, wasm3 as runtimes and AssemblyScript, Rust, Go, C, C++ as languages.

Write to me at support ampersand myname dot com

View Original