Text2Video: Text-driven Talking-head Video Synthesis with Phoneme-Pose Dictionary