Regularity as Intrinsic Reward for Free Play
Cansu Sancaktar, Justus Piater and Georg Martius
Cansu Sancaktar, Justus Piater and Georg Martius
Optimizing for RaIR with Ground Truth (GT) models gives us regular and symmetric patterns/constellations!
Some snapshots from free play
Iteration 271
Iteration 285
Iteration 292
Iteration 295
Quadruped
Walker
Free play iteration 110
Free play iteration 150
Free play iteration 165
Free play iteration 172
(Videos are RaIR + CEE-US solving the tasks)
Balance Front
Stand Rotated
Attack
Balance Back
Free play iteration 150
Free play iteration 175
Free play iteration 180
Free play iteration 190
(Videos are RaIR + CEE-US solving the tasks)
Stack Cube + Ball
Stack Cube + Flat Block