Pick a genre and a decade to train an LSTM. If there just isn't enough data, use the entire corpus. Perhaps instead of this, use transfer learning as per Slack...