MERL: Multimodal Event Representation Learning in Heterogeneous Embedding Spaces