Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Polyphonic Music Generation with Sequence Generative Adversarial Networks (1710.11418v2)

Published 31 Oct 2017 in cs.SD and eess.AS

Abstract: We propose an application of sequence generative adversarial networks (SeqGAN), which are generative adversarial networks for discrete sequence generation, for creating polyphonic musical sequences. Instead of a monophonic melody generation suggested in the original work, we present an efficient representation of a polyphony MIDI file that simultaneously captures chords and melodies with dynamic timings. The proposed method condenses duration, octaves, and keys of both melodies and chords into a single word vector representation, and recurrent neural networks learn to predict distributions of sequences from the embedded musical word space. We experiment with the original method and the least squares method to the discriminator, which is known to stabilize the training of GANs. The network can create sequences that are musically coherent and shows an improved quantitative and qualitative measures. We also report that careful optimization of reinforcement learning signals of the model is crucial for general application of the model.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Sang-gil Lee (15 papers)
  2. Uiwon Hwang (14 papers)
  3. Seonwoo Min (10 papers)
  4. Sungroh Yoon (163 papers)
Citations (18)

Summary

We haven't generated a summary for this paper yet.