Welcome to DU! The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards. Join the community: Create a free account Support DU (and get rid of ads!): Become a Star Member Latest Breaking News Editorials & Other Articles General Discussion The DU Lounge All Forums Issue Forums Culture Forums Alliance Forums Region Forums Support Forums Help & Search

highplainsdem

(60,016 posts)
Thu Sep 14, 2023, 08:16 AM Sep 2023

AI can now generate CD-quality music from text, and it's only getting better

https://arstechnica.com/information-technology/2023/09/ai-can-now-generate-cd-quality-music-from-text-and-its-only-getting-better/

AI can now generate CD-quality music from text, and it’s only getting better
Musicians: Speak now or forever hold your beats.

BENJ EDWARDS - 9/13/2023, 3:59 PM


Imagine typing "dramatic intro music" and hearing a soaring symphony or writing "creepy footsteps" and getting high-quality sound effects. That's the promise of Stable Audio, a text-to-audio AI model announced Wednesday by Stability AI that can synthesize music or sounds from written descriptions. Before long, similar technology may challenge musicians for their jobs.

-snip-

Now Stability and Harmonai want to break into commercial AI audio production with Stable Audio. Judging by production samples, it seems like a significant audio quality upgrade from previous AI audio generators we've seen.

-snip-

To train its model, Stability partnered with stock music provider AudioSparx and licensed a data set "consisting of over 800,000 audio files containing music, sound effects, and single-instrument stems, as well as corresponding text metadata." After feeding 19,500 hours of audio into the model, Stable Audio knows how to imitate certain sounds it has heard on command because the sounds have been associated with text descriptions of them within its neural network.

-snip-

As it stands, it's looking like we might be on the edge of production-quality AI-generated music with Stable Audio, considering its audio fidelity. Will musicians be happy if they get replaced by AI models? Likely not, if history has shown us anything from AI protests in the visual arts field. For now, a human can easily outclass anything AI can generate, but that may not be the case for long. Either way, AI-generated audio may become another tool in a professional's audio production toolbox.

-snip-



Stability.ai is of course the company behind image generator Stable Diffusion, which has been sued by artists. The company did not have the rights to the images used to train Stable Diffusion. They say they have a licensed data set this time.

Stability.ai announcement, with some audio samples: https://stability.ai/research/stable-audio-efficient-timing-latent-diffusion

And yes, I did register and try Stable Audio, using keywords that would fit a type of music I love.

I was hoping the results would not be as good as the article suggested. The AI music generated from text prompts that I've heard in the past weren't as good, and Google also realized there were huge copyright concerns with their MusicLM AI - https://www.democraticunderground.com/103492667 - and so far they've released only a test model, which is offered in part since users help train the AI by choosing the better of two options generated.

The music sample - only one was generated - that I got from my text prompt took longer to generate than typical AI text and images - I didn't time it, but I think it was over a minute, though that's probably due in part to high demand already - but it was much better than what I'd heard from other AI music generators.

It wouldn't have sounded bad in my favorite blues bar.

And no, I'm not going to tell you what prompt I used. The way generative AI works, you get different results from identical prompts (which is why Google wants users to help train their AI by choosing one of two options, and generative AI companies often give you 4 options simultaneously).

A few days ago I posted about Queen's Brian May and his concerns about AI: https://www.democraticunderground.com/100218263791

Stable Audio will, if anything, make him more apprehensive.

Damn damn damn...
14 replies = new reply since forum marked as read
Highlight: NoneDon't highlight anything 5 newestHighlight 5 most recent replies

emulatorloo

(46,135 posts)
2. That drum loop is repetitive as hell. At least they are using licensed music from a stock music
Thu Sep 14, 2023, 08:26 AM
Sep 2023

company to train it, rather than artists. It definitely sounds like stock music to me.

To train our flagship Stable Audio model, we used a dataset consisting of over 800,000 audio files containing music, sound effects, and single-instrument stems, as well as corresponding text metadata, provided through a deal with stock music provider AudioSparx. This dataset adds up to over 19,500 hours of audio.






highplainsdem

(60,016 posts)
4. Rick Beato always has a lot to say on YouTube about how repetitive current hits
Thu Sep 14, 2023, 08:36 AM
Sep 2023

often are.

Nothing like classic rock...

Elessar Zappa

(16,385 posts)
7. I'm a trained musician and I can tell you
Thu Sep 14, 2023, 08:56 AM
Sep 2023

that most top 40 pop music has been musically extremely repetitive since at least the 50s (and probably long before). It might be a little worse now but not by much. People essentially like to hear the same chord progressions, drum loops, bass lines, etc. over and over again. There is a lot of fairly original music out there now, just as there was back in the 60s and 70s, but in general you won’t hear it on the radio.

 

newdayneeded

(2,493 posts)
11. I heard Pharrell Williams song, Happy
Thu Sep 14, 2023, 09:06 AM
Sep 2023

for the first time in years. I swear it's just the same chorus for 3 1/2 minutes!

highplainsdem

(60,016 posts)
5. Btw, the sample I gave them a prompt for sounded better, to me, than the
Thu Sep 14, 2023, 08:38 AM
Sep 2023

audio samples in their announcement.

Hugin

(37,440 posts)
8. I'm curious as to if what the AI is doing is stitching together MIDI tracks.
Thu Sep 14, 2023, 09:00 AM
Sep 2023

Because if that’s so, the source music was originally played by humans. That would also explain why the “quality” is so good, as well.

So, in essence this is exactly the same functionality that chatbots use stitching together text replies from a huge reservoir of Internet scrapings. Except in this case they’ve upped the exploitation of the talents of the human studio musicians who provided the source.

Elessar Zappa

(16,385 posts)
9. If I know something is created by AI,
Thu Sep 14, 2023, 09:01 AM
Sep 2023

I won’t listen to it even if it’s good. I want to support actual human artists. Same with tv/movie scripts.

 

GenXer47

(1,204 posts)
10. perhaps this will finally weed out all the hacks with cucumbers in their trousers
Thu Sep 14, 2023, 09:03 AM
Sep 2023

As a jazz musician who used to practice like an Olympiad, I had to accept long ago that the music industry/profession/hobbyland is full to the brim with lazy, untalented hacks who just want to show off and maybe get laid.
And, I had to grasp that this profession is completely useless from a utilitarian point of view. We could survive (blandly) without it.
So, when I hear that AI can out-compose pretty much anyone, the lemonade from this lemon could be thirst-quenching for jazz musicians, who specialize in the love and beauty of live improvisation with other human beings. THAT is what the audience is there for - to passively participate in the incredible intimacy of two or more human beings creating harmony and dissonance without ever discussing it beforehand - to the point that we can predict what each other will play. I've "made love" to hundreds of other musicians in this way and it's a feeling on par with actual sex, or perhaps landing a jumbo jet full of human souls.
AI will scorch the landscape of musical hacks/attention seekers, and finally open up the space for those of us who "get it".

Hugin

(37,440 posts)
12. This is an interesting take on what is happening...
Thu Sep 14, 2023, 09:26 AM
Sep 2023

I can’t say I disagree with any of your points.

I will say what is going on is a little deeper, though.

Let’s say someone is recognized as the best and most talented at performing with a particular instrument. How this works is a third party lifts your riffs either with or without compensation and loads it into a machine which presents the riffs custom mixed on demand. In essence, that performer is competing against themselves.

A situation lamented by many as it occurs even now with musicians trying to change their lineup and getting thwarted by recordings of their earlier work that they no longer control.

My point being it’s still people with cucumbers in their pants who are benefiting. They are simply different people and arguably less deserving.

tinrobot

(11,952 posts)
13. Haven't we been hearing about "formulaic music" for decades?
Thu Sep 14, 2023, 09:56 AM
Sep 2023

People have been writing to a formula since the Brill Building was a thing.

Brian Eno used to compose musing by throwing dice.

And people have been using computer algorithms to compose music since at least the 1980's, when MIDI became a thing.

The only thing new about this is the algorithm.

 

Chainfire

(17,757 posts)
14. It is all a case of mind over matter; if you don't mind, it doesn't matter.
Thu Sep 14, 2023, 12:44 PM
Sep 2023

A automobile will never replace my horse!

Latest Discussions»General Discussion»AI can now generate CD-qu...