Video Object Self-Annotation