Consistency in Video Generative Models: from Clip to Wild (CVM @ AAAI '26)