kibbi (u/kibbi) - Readit News

kibbi commented on OpenAI Audio Models openai.fm/... · Posted by u/KuzeyAbi

wingworks · 5 months ago

Did you try Kokoro? You can self host that. https://huggingface.co/spaces/hexgrad/Kokoro-TTS

kibbi · 5 months ago

Thanks! But I get the impression that with Kokoro, a strong CPU still requires about two seconds to generate one sentence, which is too much of a delay for a TTS voice in an AAC app.

I'd rather accept a little compromise regarding the voice and intonation quality, as long as the TTS system doesn't frequently garble words. The AAC app is used on tablet PCs running from battery, so the lower the CPU usage and energy draw, the better.

kibbi commented on OpenAI Audio Models openai.fm/... · Posted by u/KuzeyAbi

dharmab · 5 months ago

I use Piper for one of my apps. It runs on CPU and doesn't require a GPU. It will run well on a raspberry pi. I found a couple of permissively licensed voices that could handle technical terms without garbling them.

However, it is unmaintained and the Apple Silicon build is broken.

My app also uses whisper.cpp. It runs in real time on Apple Sillicon or on modern fast CPUs like AMD's gaming CPUs.

kibbi · 5 months ago

I had already suspected that I hadn't found all the possibilities regarding Tortoise TTS, Coqui, Piper, etc. It is sometimes difficult to determine how good a TTS framework really is.

Do you possibly have links to the voices you found?

kibbi commented on OpenAI Audio Models openai.fm/... · Posted by u/KuzeyAbi

5kg · 5 months ago

May I introduce to you

https://huggingface.co/canopylabs/orpheus-3b-0.1-ft

(no affiliation)

it's English only afaics.

kibbi · 5 months ago

The sample sounds impressive, but based on their claim -- 'Streaming inference is faster than playback even on an A100 40GB for the 3 billion parameter model' -- I don't think this could run on a standard laptop.

kibbi commented on OpenAI Audio Models openai.fm/... · Posted by u/KuzeyAbi

ZeroTalent · 5 months ago

Look into https://superwhisper.com and their local models. Pretty decent.

kibbi · 5 months ago

Thank you, but they say "Offline models only run really well on Apple Silicon macs."

kibbi commented on OpenAI Audio Models openai.fm/... · Posted by u/KuzeyAbi

kibbi · 5 months ago

Large text-to-speech and speech-to-text models have been greatly improving recently.

But I wish there were an offline, on-device, multilingual text-to-speech solution with good voices for a standard PC — one that doesn't require a GPU, tons of RAM, or max out the CPU.

In my research, I didn't find anything that fits the bill. People often mention Tortoise TTS, but I think it garbles words too often. The only plug-in solution for desktop apps I know of is the commercial and rather pricey Acapela SDK.

I hope someone can shrink those new neural network–based models to run efficiently on a typical computer. Ideally, it should run at under 50% CPU load on an average Windows laptop that’s several years old, and start speaking almost immediately (less than 400ms delay).

The same goes for speech-to-text. Whisper.cpp is fine, but last time I looked, it wasn't able to transcribe audio at real-time speed on a standard laptop.

I'd pay for something like this as long as it's less expensive than Acapela.

(My use case is an AAC app.)

kibbi commented on Apple's Software Quality Crisis eliseomartelli.it/blog/20... · Posted by u/ajdude

rafram · 6 months ago

Title should be something like “Apple Pencil Pro causes iPad to overheat and slow down”. This sounds really annoying, but the overly broad title is just clickbait.

kibbi · 6 months ago

In my experience, this bug - lags and overheating when drawing with the Apple Pencil - exists since iPadOS 16. When searching for it on the web, I found lots of reports and no indication that it is solved, including by hardware replacements.

In any case, HN's guidelines ask to use the original title of an article, unless it is misleading or linkbait. I'd agree that Apple's software quality has been going down.

kibbi commented on macOS Tips and Tricks (2022) saurabhs.org/macos-tips... · Posted by u/pavel_lishin

countrymile · 6 months ago

How do I right click and add a blank text file to a folder?

kibbi · 6 months ago

I've used Apple's Automator app to add a new custom Quick Action which does exactly this. After right-clicking a folder, the right-click menu shows my custom Quick Action to create an empty text file.

This requires about 5 to 10 minutes to set up. You'll find instructions for this on the web or via some LLM. I've looked right now for a suitable article, but the ones I've found are subtly different from my Quick Action. I've asked ChatGPT and its instructions seem to be correct.

kibbi commented on When ChatGPT summarises, it does nothing of the kind ea.rna.nl/2024/05/27/when... · Posted by u/josephcsible

the_other · a year ago

You write as if you’ve found a hole in the article’s argument. The lack of evidence is a hole in the reporting, for sure. The tone of your comment suggests you feel that by not publishing all their evidence, the author’s point is wrong (rather than under-justified). However, the example you use to back up your point also backs up the article’s point. The article’s point is that ChatGPT doesn’t summarise, it only shortens. Your example indicates shortening, but not summarising.

kibbi · a year ago

In the example I mentioned, ChatGPT 4 did keep all essential statements of my texts when reproducing shorter versions of them. For example, it often wrote one high-level sentence which skillfully summarized a paragraph of the original text. As far as I understand, this is what the author meant by 'summarizing' vs. 'shortening (while missing essential statements)'.

I was impressed at those high-level summaries. If I had assigned this task to several humans, I'm not sure how many would have been able to achieve similar results.

kibbi commented on When ChatGPT summarises, it does nothing of the kind ea.rna.nl/2024/05/27/when... · Posted by u/josephcsible

kibbi · a year ago

Am I blind or is there no mention at all of the GPT model he used?

The author states his conclusions but doesn't give the reader the information required to examine the problem.

- Whether the article to be summarized fits into the tested GPT model's context size

- The prompt

- The number of attempts

- He doesn't always state which information in the summary, specifically, is missing or wrong

For example: "I first tried to let ChatGPT one of my key posts (...). ChatGPT made a total mess of it. What it said had little to do with the original post, and where it did, it said the opposite of what the post said." He doesn't say which statements of the original article were reproduced falsely by ChatGPT.

My experience is that ChatGPT 4 is good when summarizing articles, and extremely helpful when I need to shorten my own writing. Recently I had to write a grant application with a strict size limit of 10 pages, and ChatGPT 4 helped me a lot by skillfully condensing my chapters into shorter texts. The model's understanding of the (rather niche) topic was very good. I never fed it more than about two pages of text at once. It also adopted my style of writing to a sufficient degree. A hypothetical human who'd have to help on short notice probably would have needed a whole stressful day to do comparable work.

kibbi commented on Return-to-Office Mandates Aren't Worth the Talent Risks gartner.com/en/articles/t... · Posted by u/alexzeitler

kibbi · a year ago

I prefer to work intensely and collaboratively in an office.

This is how I'd do it: Three in-office days, same weekdays for everyone (e.g. Monday to Wednesday). The choice to have a 5-day or 4-day week.

An energetic, quietly humming work atmosphere, with incidental information sharing and a spirit of collaboration, with colleagues present and nearby, sounds best for me personally. Among other advantages, the presence of coworkers helps me focus.

Different strokes for different folks. Obviously, the prerequisite is that it's a team of nice people you like being around.

Sure, people who prefer to work from home would leave that company. That doesn't mean that this company will lack talent. People who do want to work like that will join it.