Spotify Interview Question

Describe how a transformer based TTS model works and reason about common issues