Gaussian Splatting to Real World Flight Navigation Transfer with Liquid Networks


Alex Quach*, Makram Chahine*, Alexander Amini, Ramin Hasani, Daniela Rus
CSAIL, MIT

* equal contribution

Overview

We demonstrate Sim-to-Real, zero-shot transfer to a real world quadrotor drone using liquid neural networks. Our liquid network policies, trained on single-step maneuvers collected from only indoor, photorealistic-simulated flight, generalize to multi-step hikes onboard a real hardware platform in outside environments. This work presents a sample efficient method to improve generalization and robustness to distribution shifts in Sim-to-Real visual quadrotor navigation tasks.

Training Dataset (Videos + Velocity Labels): https://zenodo.org/records/10966588

Gaussian Splatting Capture of Real-World Environment: https://zenodo.org/records/10723785

GitHub: https://github.com/alexquach/multienv_sim

Training:

Gaussian Splatting (GS), indoor environment, with single-step maneuvers

The Gaussian Splatting environment is used to render images and the PyBullet simulator is used to calculate dynamics. The task is to approach the target, then turn left if the ball is red-colored or turn right if the ball is blue-colored. Notably, we use a grainy simulation with cloudy artifacts for generating training data to demonstrate generalization ability. More details on hybrid simulation environment, task description, and trajectory design can be found in Section IV.

train_d6_ss2_600_1_10hzf_bm_px_td_nlsp_gn_nt.mp4

Video of first 20 examples of the total dataset (n=600)

Evaluation:

In-Distribution + Two-step Hike: GS, indoor environment

Liquid (left) video shows the ability of liquid policies to complete a composition of two targets, while the LSTM (right) policies struggle. 

cl_realgs_d6_nonorm_ss2_600_1_10hzf_bm_px_td_nlsp_gn_nt_srf_300sf_irreg2_64_hyp_cfc_mleno_3hz_05sf_100act_doub_emergency.mp4

Liquid

cl_realgs_d6_nonorm_ss2_600_3hzf_bm_px_td_nlsp_gn_nt_srf_300sf_irreg2_64_hyp_lstm_mleno_3hz_05sf_100act.mp4

LSTM

Videos of 10 examples from the total evaluation (n=100). Quantitative GS-to-GS results at Table I in Section V.C.2

GS-to-Real: Transfer to Real World, indoor environment:

Videos show the Liquid policies locking into the target with more stability than the LSTM policy. Turning before approaching is a common mistake with the LSTM policy.

holodeck_liquid_apaper.mp4

Liquid

holodeck_lstm_apaper.mp4

LSTM

Videos of 8 examples from the total evaluation (n=40). Quantitative GS-to-Real results at Table II in Section V.C.3

goodholo.mp4

Additional indoor examples, showing two-step hikes.

GS-to-Real + Generalization: Transfer to Real World, outdoor environment:

Video shows effective transfer of Liquid policies to a different scene (outdoors).

lawn_cfc110_apaper.mp4

Liquid

Videos of examples from the total evaluation (n=40). Quantitative GS-to-Real + Generalization results at Table III in Section V.C.4

goodlawn.mp4

Additional outdoor examples, where shadows are cast over the targets.

GS-to-Real + Generalization + Five-step Hike: Transfer to Real World, outdoor environment:

(Same as overview video)

Training:

Only PyBullet simulation environment, with single-step maneuvers

Only the PyBullet simulator is used to render and calculate dynamics. More details on hybrid simulation environment, task description, and trajectory design can be found in Section IV.

train_d6_ss2_600_1_10hzf_bm_px_td_nlsp_gn_nt_pyb.mp4

Video of first 20 examples of the total dataset (n=600)

Evaluation:

In-Distribution + 100-Step Hike: PyBullet environment

Our Liquid model successfully achieves the 100-step hike, but the LSTM model fails after the 3rd step. Both videos are at the same speed, demonstrating the relative hesitation of the LSTM model compared to the Liquid model.

cl_real_d6_nonorm_ss2_600_1_10hzf_bm_px_td_nlsp_gn_nt_pybullet_srf_300sf_irreg2_64_hyp_cfc_multi_vanish_60deg_100_sam_pnorm_2.mp4

Liquid

cl_real_d6_nonorm_ss2_600_3hzf_bm_px_td_nlsp_gn_nt_pybullet_srf_300sf_irreg2_64_hyp_lstm_multi_vanish_60deg_100_sam_pnorm_2.mp4

LSTM

Video of the best 100-step hike for both the Liquid and LSTM models (2nd Run in Fig. 5 in Section Section V.C.1).