Hierarchical Policy Design for Sample-Efficient Learning of Robot Table Tennis Through Self-Play

Model-Based Land-Ball Controller

No forward search.

Forward search using Cross-Entropy Method (CEM)