Anomaly Recognition in surveillance videos using 3D convolutional network 

Publication :

           https://link.springer.com/article/10.1007/s11042-021-10570-3

     https://github.com/ramnamaqsood/Activity_Recognition-3DCNN

    https://drive.google.com/file/d/12YJy_ut4fP6ax_c3QZbx_QkczuT2WlQj/view?usp=sharing



Annotations of UCF Crime dataset:

You can find our frame level annotations of complete UCF Crime dataset after filling this form: https://docs.google.com/forms/d/e/1FAIpQLSf-qboNRohi7ZV6hdo0p8DH_mliGPwtERqo8OXlIM_Pd7y1Mg/viewform



Outline/Abstract :

  Anomalous activity recognition deals with identifying the patterns and events that vary from the normal stream. In a surveillance paradigm, these events range from abuse to fighting and road accidents to snatching, etc. Due to the sparse occurrence of anomalous events, anomalous activity recognition from surveillance videos is a challenging research task. The approaches reported can be generally categorized as handcrafted and deep learning-based. Most of the reported studies address binary classification i.e. anomaly detection from surveillance videos. But these reported approaches did not address other anomalous events e.g. abuse, fight, road accidents, shooting, stealing, vandalism, and robbery, etc. from surveillance videos. Therefore, this paper aims to provide an effective framework for the recognition of different real-world anomalies from videos. This study provides a simple, yet effective approach for learning spatiotemporal features using deep 3-dimensional convolutional networks (3D ConvNets) trained on the University of Central Florida (UCF) Crime video dataset. Firstly, the frame-level labels of the UCF Crime dataset are provided, and then to extract anomalous spatiotemporal features more efficiently a fine-tuned 3D ConvNets is proposed. Findings of the proposed study are twofold 1) There exist specific, detectable, and quantifiable features in UCF Crime video feed that associate with each other 2) Multiclass learning can improve generalizing competencies of the 3D ConvNets by effectively learning frame-level information of dataset and can be leveraged in terms of better results by applying spatial augmentation. The proposed study extracted 3D features by providing frame level information and spatial augmentation to a fine-tuned pre-trained model, namely 3DConvNets. Besides, the learned features are compact enough and the proposed approach outperforms significantly from state of art approaches in terms of accuracy on anomalous activity recognition having 82% AUC. 

Challenges

Motivation

WhatsApp Video 2019-12-23 at 10.00.13.mp4

Demo: Explosion Recognition in test Video

thesis_Report_final.pdf
defense_presentation.pptx




System Pipeline

Usefull Reads

[1]  "UCF-CRIME[online]," Available at :https://www.kaggle.com/mission-ai/crimeucfdataset, 2016."

[2] "Wang, T., & Snoussi, H. (2013, January). Histograms of optical flow orientation for abnormal events detection. In 2013 IEEE International Workshop on Performance Evaluation of Tracking and Surveillance (PETS) (pp. 45-52). IEEE."

[3] "Farooq, M. U., Khan, N. A., & Ali, M. S. (2017). Unsupervised Video Surveillance for Anomaly Detection of Street Traffic. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 8(12), 270-275."

[4] "Sultani, W., Chen, C., & Shah, M. (2018). Real-world Anomaly Detection in Surveillance Videos. Center for Research in Computer Vision (CRCV), University of Central Florida (UCF)."

[5] "Tran, D., Bourdev, L., Fergus, R., Torresani, L., & Paluri, M. (2015). Learning spatiotemporal features with 3d convolutional networks. In Proceedings of the IEEE international conference on computer vision (pp. 4489-4497)."

[6] "Sabokrou, M., Fathy, M., Hoseini, M., & Klette, R. (2015). Real-time anomaly detection and localization in crowded scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops (pp. 56-62)."

[7] "Sabokrou, M., Fayyaz, M., Fathy, M., & Klette, R. (2017). Deep-cascade: cascading 3D deep neural networks for fast anomaly detection and localization in crowded scenes. IEEE Transactions on Image Processing, 26(4), 1992-2004."

Dr. Usama Ijaz Bajwa

Co-PI, Video Analytics lab, National Centre in Big Data and Cloud Computing,Program Chair (FIT 2019),HEC Approved PhD Supervisor,Assistant Professor & Associate Head of DepartmentDepartment of Computer Science,COMSATS University Islamabad, Lahore Campus, Pakistanwww.usamaijaz.comwww.fit.edu.pkJob ProfileGoogle Scholar Profile
Ramna MaqsoodEmail: ramna.maqsood@gmail.comGithub Profile: https://github.com/ramnamaqsood?tab=repositoriesAssociate Lecturer in Faculty of Information and Technology, University of Central Punjab, Lahore Pakistan