SilhoueTTS: Reference Guided Text to Speech using Prosody Features