These displays illustrate multiple object tracking with different hierarchical structures. The titles of each video denote the structure of the motion tree. Depth indicates the number of levels in the tree, and width(L) denotes the number of distinct clusters (i.e., object parts) at level L.