Drysdale, J., Tomczak, M., Hockman, J., 2021.
Style-based drum synthesis with GAN inversion
Output Type: | Conference paper |
Presented at: | 22nd International Society of Music Information Retrieval Conference |
Publication: | Extended Abstracts for the Late-Breaking Demo Session of the 22nd Int. Society for Music Information Retrieval Conference |
Venue: | Online |
Publisher: | International Society for Music Information Retrieval |
Dates: | 7/11/2021 - 12/11/2021 |
URL: | ismir2021.ismir.net/lbd |
Neural audio synthesizers exploit deep learning as an alternative to traditional synthesizers that generate audio from hand-designed components such as oscillators and wavetables. For a neural audio synthesizer to be applicable to music creation, meaningful control over the output is essential. This paper provides an overview of an unsupervised approach to deriving useful feature controls learned by a generative model. A system for generation and transformation of drum samples using a style-based generative adversarial network (GAN) is proposed. The system provides functional control of audio style features, based on principal component analysis (PCA) applied to the intermediate latent space. Additionally, we propose the use of an encoder trained to invert input drums back to the latent space of the pre-trained GAN. We experiment with three modes of control and provide audio results on a supporting website.