Since our launch ten months ago, my cofounder and I have continued to improve our music model significantly. You can listen to some cool Staff Picks songs from the latest version here https://sonauto.ai/ , listen to an acapella song I made for my housemate here https://sonauto.ai/song/8a20210c-563e-491b-bb11-f8c6db92ee9b , or try the free and unlimited generations yourself.
However, given there are only two of us right now competing in the "best model and average user UI" race we haven't had the time to build some of the really neat ideas our users and pro musicians have been dreaming up (e..g, DAW plugins, live performance transition generators, etc). The hacker musician community has a rich history of taking new tech and doing really cool and unexpected stuff with it, too.
As such, we're opening up an API that gives full access to the features of our underlying diffusion model (e.g., generation, inpainting, extensions, transition generation, inverse sampling). Here are some things our early test users are already doing with it:
- A cool singing-to-video model by our friends at Lemon Slice: https://x.com/LemonSliceAI/status/1894084856889430147 (try it yourself here https://lemonslice.com/studio)
- Open source wrapper written by one of our musician users: https://github.com/OlaFosheimGrostad/networkmusic
- You can also play with all the API features via our consumer UI here: https://sonauto.ai/create
We also have some examples written in Python here: https://github.com/Sonauto/sonauto-api-examples
- Generate a rock song: https://github.com/Sonauto/sonauto-api-examples/blob/main/ro...
- Download two songs from YouTube (e.g., Smash Mouth to Rick Astley) and generate a transition between them: https://github.com/Sonauto/sonauto-api-examples/blob/main/tr...
- Generate a singing telegram video (powered by ours and also Lemon Slice's API): https://github.com/Sonauto/sonauto-api-examples/blob/main/si...
You can check out the full docs/get your key here: https://sonauto.ai/developers
We'd love to hear what you think, and are open to answering any tech questions about our model too! It's still a latent diffusion model, but much larger and with a much better GAN decoder.
For instance a guitarist will have a track they wish they had vocals for(and lyrics) for and if they could pay for that they would.
Literally if you could highlight a tune section in your DAW, prompt it, and vocals + lyrics were generated, possibly different version or harmonies for existing parts etc. Musicians already pay for plugins but the singing ones are awful to use so far.
I probably would be happy paying a service I could drop a riff into and get decent drum track that goes with it. Even more would be while recording or playing it modifies and adapts, it can be recorded and clipped. Something that fits a clean workflow. If anyone makes this please don't make it such a pain as most VSTs and plugin systems where there are like 4 different installers and licensing software layers.
The other very real aspect here is "training data" has to come from somewhere, and the copyright implications of this are beyond solved.
In the past I worked on real algorithmic music composition: algorithmic sequencer, paired with hardware- or soft- synthesizers. I could give it feedback and it'd evolve the composition, all without training data. It was computationally cheap, didn't infringe anyone's copyright, and a human still had very real creative influence (which instruments, scale, tempo, etc.). Message me if anyone's still interested in "dumb" AI like that. :-)
Computer-assisted music is nothing new, but taking away the creativity completely is turning music into noise -- noise that sounds like music.
The reason is greed. They jump on the bandwagon to get rich, not to bring art. They don't care about long term effects on creativity. If it means that it kills motivation to create new music, or even learn how to play an instrument, that's fine by these people. As long as they get their money.
Not sure how to reach out, but I'm definitely interested in reading about procedural methods in music synthesis. Any links describing your approach?
Actually, noise that sounds like music is some of the best music there is: electroacoustic music.
A lot better than most music on the radio. ;-)
I don't see any contact info in your profile, but I have an email in mine. I am interested in hearing more about your process and if you have music for sale anywhere, I like to support electronic artists doing interesting stuff.
It’s less than worthless.
How is that specific to text prompting? If you tap your fingers to a model and it generates a song from your tapping, it's still just fitting the training data as you say.
Deleted Comment
Here are a few of my songs, I think they are fairly consistent?
https://sonauto.ai/song/e2e3d210-69b4-4ad7-96d1-fb5744d0c648
https://sonauto.ai/song/a94e04a9-7b74-4b87-b5ed-ca3e8d2798d0
https://sonauto.ai/song/55a36595-c60a-4346-81d8-6f03ebe690ff
8. OUTPUT As between You and the Services, and to the extent permitted by applicable law, You own any right, title, or interest that may exist in the musical and/or audio content that You generate using the Services ("Outputs"). We hereby assign to You all our right, title, and interest, if any, in and to Your Outputs. This assignment does not extend to other users' Outputs, regardless of similarity between Your Outputs and their Outputs. You grant to us an unrestricted, unlimited, irrevocable, perpetual, non-exclusive, transferable, royalty-free, fully-paid, worldwide license to use Your Output to provide, maintain, develop, and improve the Services, to comply with applicable law, and/or to enforce our terms and policies. You are solely responsible for Outputs and Your use of Outputs, including ensuring that Outputs and Your use thereof do not violate any applicable law or these terms of service. We make no warranties or representations regarding the Outputs, including as to their copyrightability or legality. By using the Services, You warrant that You will use Outputs only for legal purposes.
You own the rights, but Sonauto is granted the rights to use it as well.
>We hereby assign to You all our right, title, and interest, if any
>You are solely responsible for Outputs and Your use of Outputs
I love how it clearly laid out the scenario that the right don't exist, yet you are responsible.
Error creating billing portal Failed to create billing portal session: No configuration provided and your live mode default configuration has not been created. Provide a configuration or create your default by saving your customer portal settings in live mode at https://dashboard.stripe.com/settings/billing/portal.
Also when trying to update my profile picture :
Failed to update image! column users.current_period_end does not exist