Inspired by the idea of reinforcement learning, a SeqGAN algorithm (Yu et al. 2017) wasproposed to bypass the generator differentiation problem by directly performing the gradient policy update and successfully applied the result to text and music generation.