Readit News logoReadit News
sandslides commented on New evidence that Cantor plagiarized Dedekind?   quantamagazine.org/the-ma... · Posted by u/rbanffy
dkarl · 13 days ago
> In their 1872 papers, though, Cantor and Dedekind had found a way to construct a number line that was complete. No matter how much you zoomed in on any given stretch of it, it remained an unbroken expanse of infinitely many real numbers, continuously linked.

> Suddenly, the monstrosity of infinity, long feared by mathematicians, could no longer be relegated to some unreachable part of the number line. It hid within its every crevice.

I'm vaguely familiar with some of the mathematics, but I have no idea what this is trying to say. The infinity of the rational numbers had been known a thousand years prior by the Greeks, including by Zeno whom the article already mentioned. The Greeks also knew that some quantities could not be expressed as rational numbers.

I would assume the density of irrational numbers was already known as well? Give x < y, it's easy to construct x + (y-x)(sqrt(2))/2.

I don't get what "suddenly" became apparent.

sandslides · 13 days ago
could I just leave my favourite thing ever here? thanks :)

https://en.wikipedia.org/wiki/Hilbert%27s_paradox_of_the_Gra...

sandslides commented on VoiceCraft: Open-source high quality voice cloning / voice editing   github.com/jasonppy/Voice... · Posted by u/sandslides
sandslides · 2 years ago
The model weights have been uploaded to Huggingface : https://huggingface.co/pyp1/VoiceCraft

This seems to be really high quality judging by the demo's. Not had time to try it for myself

Demos : https://jasonppy.github.io/VoiceCraft_web/

sandslides commented on StyleTTS2 – open-source Eleven-Labs-quality Text To Speech   github.com/yl4579/StyleTT... · Posted by u/sandslides
eigenvalue · 2 years ago
I tried it. Sounds absolutely nothing like my voice or my wife's voice. I used the same sample files as I used 2 days ago on the Eleven Labs website, and they worked flawlessly there. So this is very, very far from being close to "Eleven Labs quality" when it comes to voice cloning.
sandslides · 2 years ago
The speech generated is the best I've heard from an open source model. The one test I made didn't make an exact clone either but this is still early days. There's likely something not quite right. The cloned voice does speak without any artifacts or other weirdness that most TTS systems suffer from.
sandslides commented on StyleTTS2 – open-source Eleven-Labs-quality Text To Speech   github.com/yl4579/StyleTT... · Posted by u/sandslides
eigenvalue · 2 years ago
Was somewhat annoying to get everything to work as the documentation is a bit spotty, but after ~20 minutes it's all working well for me on WSL Ubuntu 22.04. Sound quality is very good, much better than other open source TTS projects I've seen. It's also SUPER fast (at least using a 4090 GPU).

Not sure it's quite up to Eleven Labs quality. But to me, what makes Eleven so cool is that they have a large library of high quality voices that are easy to choose from. I don't yet see any way with this library to get a different voice from the default female voice.

Also, the real special sauce for Eleven is the near instant voice cloning with just a single 5 minute sample, which works shockingly (even spookily) well. Can't wait to have that all available in a fully open source project! The services that provide this as an API are just too expensive for many use cases. Even the OpenAI one which is on the cheaper side costs ~10 cents for a couple thousand word generation.

sandslides · 2 years ago
The LibriTTS demo clones unseen speakers from a five second or so clip
sandslides commented on StyleTTS2 – open-source Eleven-Labs-quality Text To Speech   github.com/yl4579/StyleTT... · Posted by u/sandslides
progbits · 2 years ago
> MIT license

> Before using these models, you agree to [...]

No, this is not MIT. If you don't like MIT license then feel free to use something else, but you can't pretend this is open source and then attempt to slap on additional restrictions on how the code can be used.

sandslides · 2 years ago
Yes, I noticed that. Doesn't seem right does it
sandslides commented on StyleTTS2 – open-source Eleven-Labs-quality Text To Speech   github.com/yl4579/StyleTT... · Posted by u/sandslides
fullstackchris · 2 years ago
Great stuff, took a look through the README but... what are the minimum hardware requirements to run this? Is this gonna blow up my CPU / harddrive?
sandslides · 2 years ago
Not sure. The only inference demos are colab notebooks. The models are approx 700mb each so I imagine it will run on modest gpu
sandslides commented on StyleTTS2 – open-source Eleven-Labs-quality Text To Speech   github.com/yl4579/StyleTT... · Posted by u/sandslides
sandslides · 2 years ago
Just tried the collab notebooks. Seems to be very good quality. It also supports voice cloning.

u/sandslides

KarmaCake day291November 19, 2023View Original