iSUMA:  Improving Scene Understanding with multiple sensor Modalities and Active perception