Interactive 3D Annotation of Objects in Moving Videos from Sparse Multi-view Frames