Multimodal Pretraining Under Sensor Diversity: Lessons Learned from Foundation Models for Earth and Mars