Each time-step represents one-half of a second. The training is carried out on a 64 x 64 spatial grid.
*** The source codes for reproducing the results presented in this paper will be made available publicly upon acceptance ***
The Pre-training is done on a 256 spatial grid.
(Train on a given resolution and Test on a different resolution)