Improved Video-Driven Speech Reconstruction