Supertone's Shift offers real-time voice changing technology. It lets users immediately switch to any selected voice. Just pick a voice and begin speaking. Shift is suited for VTubers, content creators, and gamers, as well as anyone who wishes to accurately express their chosen persona's voice. Try out Supertone Shift now.
>> https://product.supertone.ai/shift
I would like some clarity on the Terms of Service clause 4:
> The content created using Supertone Shift remains your property. However, by using our Services, you grant Supertone a worldwide, non-exclusive, royalty-free license to use, reproduce, adapt, and display content solely for the purpose of operating and improving Supertone Shift. This license does not grant Supertone any rights to sell or distribute your content.
Does Supertone Shift need the user content in order to further improve the product during the beta period?
Or does it need the user content in normal operation (for example, running the conversion on remote servers vs local processing)?
I can see some hesitation from people if you're recording everything they say, and keeping that recording for an indefinite period of time.
I can appreciate that there may be a problem enforcing a "Don't use our product for evil" clause, if you can review usage.
The challenge here seems overwhelming.
That being said, I hate "remains your property" part. It's just fluff that changes nothing, but distracts from the following sentence.
"Remains your property" is not fluff at all, and explicitly disclaims any ownership of rights associated with content you post, and equivalently indemnifies users against any liability for re-posting or re-using content they posted here, which they'd potentially be exposed to if they were assigning copyright to the hosting platform rather than just granting a license.
we may need your data for some unspecified purpose ("AI model training") that we can't even dream of right now, so we'll just take all the rights
Commercial applications like Voice.Ai and Koe are real time and have celebrity and anime voices respectively.
The RVC ecosystem on GitHub has dozens of different real time open source voice changers. I haven't kept up with the SOTA, but they're incredible, fine tunable, and 100% local.
https://voice.ai
https://koe.ai
https://m.youtube.com/watch?v=zkaBK5erB2c
They all use like 50% of my cpu to get real time. I was able to get actual low latency with koi, but still massive cpu usage. And theres no community of models for it either.
Perhaps someone who really knows what theyre doing could optimize these open source models but its not me
Deleted Comment
I would love to test the technology without the risk of damaging my computer!
In fact, it was Supertone Shift's installer that prodded me to seek it out (I happened to find and install Shift a couple of weeks ago).
In this case, it needs admin permissions to install to `/Library/Application Support` as well as `/Library/Audio`.
It needs to restart in order for the HAL driver to be loaded (this provides the virtual audio interface for using the app with Teams, Zoom, etc.)
The preinstall/postinstall scripts simply handle the app's directory in Application Support.
I decided it was safe enough, and had some fun playing with it. It contacts what it claims are licensing servers (when it starts), and won't start without it. It wanted to keep contacting those servers constantly, but blocking its network access via Little Snitch didn't prevent it from functioning. The network traffic was in the single-digit kilobyte range, so I felt reasonably confident no audio data was being looted.
[0] https://mothersruin.com/software/SuspiciousPackage/
You: Why? I already own a 2002 Ford Escape...
I'm not trying to make fun of you, I think you actually have a unique and impressive perspective! I've always hated hearing my voice on answering machines, so if I could choose any voice I'd choose Chris Cornell or Morgan Freeman.
[1] https://audio.sunflower.industries
I know watermarks are never foolproof, but they may deter casual misuse.
So, from what I understand, I cannot use it and then upload the training video to Youtube. Or can I?
Interesting legal problem.