Oh boy, Meta just open-sourced a few models which actually seem kinda wild, the big ones (for us) being:
MusicGen generates music from text-based user inputs. All of the generic background music you could ever want is available on-demand, with plenty of samples here.
AudioGen generates sound effects from text-based user inputs. Find examples here of things like "A duck quacking as birds chirp and a pigeon cooing," which absolutely is as advertised.
In the same way stock photography is being eaten by AI, foley) is up next.