Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning a Latent Space of Multitrack Measures (1806.00195v1)

Published 1 Jun 2018 in stat.ML, cs.LG, cs.SD, and eess.AS

Abstract: Discovering and exploring the underlying structure of multi-instrumental music using learning-based approaches remains an open problem. We extend the recent MusicVAE model to represent multitrack polyphonic measures as vectors in a latent space. Our approach enables several useful operations such as generating plausible measures from scratch, interpolating between measures in a musically meaningful way, and manipulating specific musical attributes. We also introduce chord conditioning, which allows all of these operations to be performed while keeping harmony fixed, and allows chords to be changed while maintaining musical "style". By generating a sequence of measures over a predefined chord progression, our model can produce music with convincing long-term structure. We demonstrate that our latent space model makes it possible to intuitively control and generate musical sequences with rich instrumentation (see https://goo.gl/s2N7dV for generated audio).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ian Simon (16 papers)
  2. Adam Roberts (46 papers)
  3. Colin Raffel (83 papers)
  4. Jesse Engel (30 papers)
  5. Curtis Hawthorne (17 papers)
  6. Douglas Eck (24 papers)
Citations (51)

Summary

We haven't generated a summary for this paper yet.