MLA: A Multisensory LanguageAction Model for Multimodal Integration and Forecasting in Robotic Manipulation