Toward Learning to Detect and Predict Contact Events on Vision-based Tactile Sensor