Improving robustness of spontaneous speech synthesis with linguistic speech regularization and pseudo-filled-pause insertion