bridge.mp4
The reconstruction quality on the water surface, road surface, and rooftops is significantly superior to other methods.
bridge2.mp4
The reconstruction results of different methods are similar, with slightly better performance than the baseline methods (UnModNet) in reconstructing the sky (clouds) and rooftops.
hallway.mp4
The reconstruction results of different methods are similar, with a slight improvement over the baseline method (UnModNet) in the reconstruction of the wall at the beginning of the video.
hallway2.mp4
The reconstruction results of all methods are suboptimal (with noticeable distortions visible to the naked eye), but SSViT achieves significantly better reconstruction of the sky and the wall compared to other methods.
exhibition_area_light.mp4
The reconstruction quality of different methods is similar, with a slight improvement over the baseline method (UnModNet) in recovering the patterns on the windows.
exhibition_area_dark.mp4
The reconstruction quality of all methods is suboptimal (with noticeable distortions visible to the naked eye), but SSViT achieves significantly better reconstruction of people and clothing compared to other methods.
exhibition_area_combined.mp4
When connecting the bright and dark regions, our method demonstrates superior temporal consistency.
river.mp4
The reconstruction quality of different methods is similar, but SSViT outperforms the others in recovering water ripples.