Creating Latent Spaces for Modern Music Genre Rhythms Using Minimal Training Data


International Conference on Computational Creativity

Gabriel Vigliensoni, Louis McCallum, and Rebecca Fiebrink. 2020. In Proceedings of the 11th International Conference on Computational Creativity (ICCC’20).

In this paper, we present research on customizing a variational autoencoder (VAE) neural network to play with musical rhythms encoded within a latent space. To enable customization and personalization, the network can be trained with as few as one dozen MIDI clips with rhythms. Additionally, our approach employs a data structure that is capable of encoding rhythms with binary, ternary, or a combined metrical grid. A metrical grid can be explained as the main ratio by which the onsets of notes are placed in a measure. Music where the beats are split in two—a binary grid—is in simple meter. Music where the beats are split in three—a ternary grid—is in compound meter. Many modern music genres, such as footwork, gqom, dembow, or trap, can be characterized by rhythmic elements using a compound meter. In these rhythms, the main meter is usually simple but there are elements that are placed on an overlaid ternary grid. To the best of our knowledge, this work is the first time that a network architecture has been used to encode rhythms that exhibit a combined binary and ternary grid.