View a PDF of the paper titled Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion, by Yujia Huang and 8 other authors
Abstract:We study the problem of symbolic music generation (e.g., generating piano rolls), with a technical focus on non-differentiable rule guidance. Musical rules are often expressed in symbolic form on note characteristics, such as note density or chord progression, many of which are non-differentiable which pose a challenge when using them for guided diffusion. We propose Stochastic Control Guidance (SCG), a novel guidance method that only requires forward evaluation of rule functions that can work with pre-trained diffusion models in a plug-and-play way, thus achieving training-free guidance for non-differentiable rules for the first time. Additionally, we introduce a latent diffusion architecture for symbolic music generation with high time resolution, which can be composed with SCG in a plug-and-play fashion. Compared to standard strong baselines in symbolic music generation, this framework demonstrates marked advancements in music quality and rule-based controllability, outperforming current state-of-the-art generators in a variety of settings. For detailed demonstrations, code and model checkpoints, please visit our project website: this https URL.
Submission history
From: Yujia Huang [view email]
[v1]
Thu, 22 Feb 2024 04:55:58 UTC (1,758 KB)
[v2]
Fri, 23 Feb 2024 02:15:32 UTC (1,758 KB)
[v3]
Mon, 3 Jun 2024 02:47:27 UTC (1,760 KB)
[v4]
Wed, 25 Sep 2024 03:12:27 UTC (1,760 KB)
Source link
lol