Speech-to-Singing Conversion in an Encoder-Decoder Framework — arXiv2