(Publisher of Peer Reviewed Open Access Journals)

International Journal of Advanced Technology and Engineering Exploration (IJATEE)

ISSN (Print):2394-5443    ISSN (Online):2394-7454
Volume-9 Issue-95 October-2022
Full-Text PDF
Paper Title : An integrated framework for abnormal event detection and video summarization using deep learning
Author Name : G. Balamurugan and J. Jayabharathy
Abstract :

In the real-world modern environment, intelligent transportation systems and intelligent surveillance systems are considered to play an anchor role in facilitating security and safety to the human society. These surveillance systems are diversely utilized in most of the places ranging from the application of border security to a street monitoring system that closely observes the abnormal event occurrence on the road. The core aim of the work is to present a rich set of abnormal event videos for intelligent surveillance systems. Hence, potential abnormal event detection and considerable video summarization mechanism is needed with maximum accuracy and reduced complexities. In this paper, hybrid convolution neural network (CNN) and bi-directional long short-term memory (Bi-LSTM) based abnormal event detection model is employed with reduced complexity. This model includes convolutional neural network with a pre-trained model for extracting spatio-temporal features from each individual frame selected from a series of frames, which is then passed to multi-layer Bi-LSTM that possesses the capability of accurately classifying the abnormal events in complex surveillance scenes of the road. Hierarchical temporal attention-based long short-term memory (LSTM) encoder-decoder model is included in the fine grain process in order to attain a better video summarization that preserves video key information and attains an optimal storage. The experimental results with the functional parameters confirmed the maximized abnormal event frames with minimum complexity using video summarization method and the accuracy of the abnormal detection scheme, on par with the benchmarked approaches considered for investigation.

Keywords : Road surveillance system, Abnormal event detection, Bi-directional long short-term memory, ResNet-50, Video. summarization, Hierarchical temporal attention-based long short-term memory encoder-decoder model.
Cite this article : Balamurugan G, Jayabharathy J. An integrated framework for abnormal event detection and video summarization using deep learning . International Journal of Advanced Technology and Engineering Exploration. 2022; 9(95):1494-1507. DOI:10.19101/IJATEE.2021.875854.
References :
[1]Li B, Leroux S, Simoens P. Decoupled appearance and motion learning for efficient anomaly detection in surveillance video. Computer Vision and Image Understanding. 2021.
[Crossref] [Google Scholar]
[2]Gaikwad B, Karmakar A. Smart surveillance system for real-time multi-person multi-camera tracking at the edge. Journal of Real-Time Image Processing. 2021; 18(6):1993-2007.
[Crossref] [Google Scholar]
[3]Kalair K, Connaughton C. Anomaly detection and classification in traffic flow data from fluctuations in the flow–density relationship. Transportation Research Part C: Emerging Technologies. 2021.
[Crossref] [Google Scholar]
[4]Kolekar S, Gite S, Pradhan B, Kotecha K. Behavior prediction of traffic actors for intelligent vehicle using artificial intelligence techniques: a review. IEEE Access. 2021; 9:135034-58.
[Crossref] [Google Scholar]
[5]Meng Q, Shang B, Liu Y, Guo H, Zhao X. Intelligent vehicles trajectory prediction with spatial and temporal attention mechanism. IFAC-PapersOnLine. 2021; 54(10):454-9.
[Crossref] [Google Scholar]
[6]Chu W, Wuniri Q, Du X, Xiong Q, Huang T, Li K. Cloud control system architectures, technologies and applications on intelligent and connected vehicles: a review. Chinese Journal of Mechanical Engineering. 2021; 34(1):1-23.
[Crossref] [Google Scholar]
[7]Nittayasoot N, Peterson AB, Thammawijaya P, Parker EM, Sathawornwiwat A, Boonthanapat N, et al. Evaluation of a hospital-based injury surveillance system for monitoring road traffic deaths in Phuket, Thailand. Traffic Injury Prevention.2019; 20(4):365-71.
[Crossref] [Google Scholar]
[8]Balamurugan G, Jayabharathy J. A comparative analysis of event detection and video summarization. In ambient communications and computer systems 2022 (pp. 577-86). Springer, Singapore.
[Crossref] [Google Scholar]
[9]Fu H, Wu L, Jian M, Yang Y, Wang X. MF-SORT: simple online and realtime tracking with motion features. In international conference on image and graphics 2019 (pp. 157-68). Springer, Cham.
[Crossref] [Google Scholar]
[10]Zhu J, Zhang S, Yang J. Online multi-object tracking using single object tracker and Markov clustering. In international conference on image and graphics 2019 (pp. 555-67). Springer, Cham.
[Crossref] [Google Scholar]
[11]Borji A, Cheng MM, Hou Q, Jiang H, Li J. Salient object detection: a survey. Computational Visual Media. 2019; 5(2):117-50.
[Crossref] [Google Scholar]
[12]Fan DP, Wang W, Cheng MM, Shen J. Shifting more attention to video salient object detection. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition 2019 (pp. 8554-64).
[Google Scholar]
[13]Santamaria AF, Raimondo P, Tropea M, De RF, Aiello C. An IoT surveillance system based on a decentralised architecture. Sensors. 2019; 19(6):1-23.
[Crossref] [Google Scholar]
[14]Li R, Pereira FC, Ben-akiva ME. Overview of traffic incident duration analysis and prediction. European Transport Research Review. 2018; 10(2):1-13.
[Crossref] [Google Scholar]
[15]https://www.v7labs.com/blog/object-detection-guide. Accessed 27 August 2022.
[16]Benterki A, Boukhnifer M, Judalet V, Maaoui C. Artificial intelligence for vehicle behavior anticipation: hybrid approach based on maneuver classification and trajectory prediction. IEEE Access. 2020; 8:56992-7002.
[Crossref] [Google Scholar]
[17]Al-omari A, Shatnawi N, Khedaywi T, Miqdady T. Prediction of traffic accidents hot spots using fuzzy logic and GIS. Applied Geomatics. 2020; 12(2):149-61.
[Crossref] [Google Scholar]
[18]Shen J, Tao D, Li X. Modality mixture projections for semantic video event detection. IEEE Transactions on Circuits and Systems for Video Technology. 2008; 18(11):1587-96.
[Crossref] [Google Scholar]
[19]Yi D, Su J, Liu C, Chen WH. Trajectory clustering aided personalized driver intention prediction for intelligent vehicles. IEEE Transactions on Industrial Informatics. 2018; 15(6):3693-702.
[Crossref] [Google Scholar]
[20]Martinesco A, Netto M, Neto AM, Etgens VH. A note on accidents involving autonomous vehicles: interdependence of event data recorder, human-vehicle cooperation and legal aspects. IFAC-PapersOnLine. 2019; 51(34):407-10.
[Crossref] [Google Scholar]
[21]Ji Z, Zhang Y, Pang Y, Li X, Pan J. Multi-video summarization with query-dependent weighted archetypal analysis. Neurocomputing. 2019; 332:406-16.
[Crossref] [Google Scholar]
[22]Wan S, Xu X, Wang T, Gu Z. An intelligent video analysis method for abnormal event detection in intelligent transportation systems. IEEE Transactions on Intelligent Transportation Systems. 2020; 22(7):4487-95.
[Crossref] [Google Scholar]
[23]Gelmini S, Strada S, Tanelli M, Savaresi S, De TC. Automatic crash detection system for two-wheeled vehicles: design and experimental validation. IFAC-PapersOnLine. 2019; 52(5):498-503.
[Crossref] [Google Scholar]
[24]Jahangiri A, Berardi VJ, Machiani SG. Application of real field connected vehicle data for aggressive driving identification on horizontal curves. IEEE Transactions on Intelligent Transportation Systems. 2017; 19(7):2316-24.
[Crossref] [Google Scholar]
[25]Balamurugan G, Jayabharathy J. A study on moving object recognition for video surveillance applications. Journal of Advanced Research in Dynamical and Control Systems. 2019:157-67.
[26]Lei J, Luan Q, Song X, Liu X, Tao D, Song M. Action parsing-driven video summarization based on reinforcement learning. IEEE Transactions on Circuits and Systems for Video Technology. 2018; 29(7):2126-37.
[Crossref] [Google Scholar]
[27]Yuan Y, Mei T, Cui P, Zhu W. Video summarization by learning deep side semantic embedding. IEEE Transactions on Circuits and Systems for Video Technology. 2017; 29(1):226-37.
[Crossref] [Google Scholar]
[28]Kalaivani P, Roomi SM, Jaishree B. Video event representation for abnormal event detection. In international conference on circuits and systems 2017 (pp. 463-8). IEEE.
[Crossref] [Google Scholar]
[29]Ahmed SA, Dogra DP, Kar S, Roy PP. Trajectory-based surveillance analysis: A survey. IEEE Transactions on Circuits and Systems for Video Technology. 2018; 29(7):1985-97.
[Crossref] [Google Scholar]
[30]Moses TM, Balachandran K. A classified study on semantic analysis of video summarization. In international conference on algorithms, methodology, models and applications in emerging technologies 2017 (pp. 1-6). IEEE.
[Crossref] [Google Scholar]
[31]Wang T, Miao Z, Chen Y, Zhou Y, Shan G, Snoussi H. Aed-net: an abnormal event detection network. Engineering. 2019; 5(5):930-9.
[Crossref] [Google Scholar]
[32]Chu W, Xue H, Yao C, Cai D. Sparse coding guided spatiotemporal feature learning for abnormal event detection in large videos. IEEE Transactions on Multimedia. 2018; 21(1):246-55.
[Crossref] [Google Scholar]
[33]Kalaivani P, Roomi SM. Towards comprehensive understanding of event detection and video summarization approaches. In second international conference on recent trends and challenges in computational models 2017 (pp. 61-6). IEEE.
[Crossref] [Google Scholar]
[34]Meghdadi AH, Irani P. Interactive exploration of surveillance video through action shot summarization and trajectory visualization. IEEE Transactions on Visualization and Computer Graphics. 2013; 19(12):2119-28.
[Crossref] [Google Scholar]
[35]Tejero-de-Pablos A, Nakashima Y, Sato T, Yokoya N, Linna M, Rahtu E. Summarization of user-generated sports video by using deep action recognition features. IEEE Transactions on Multimedia. 2018; 20(8):2000-11.
[Crossref] [Google Scholar]
[36]Coşar S, Donatiello G, Bogorny V, Garate C, Alvares LO, Brémond F. Toward abnormal trajectory and event detection in video surveillance. IEEE Transactions on Circuits and Systems for Video Technology. 2016; 27(3):683-95.
[Crossref] [Google Scholar]
[37]Muhammad K, Hussain T, Del Ser J, Palade V, De Albuquerque VH. DeepReS: a deep learning-based video summarization strategy for resource-constrained industrial surveillance scenarios. IEEE Transactions on Industrial Informatics. 2019; 16(9):5938-47.
[Crossref] [Google Scholar]
[38]Kong L, Dai R. Object-detection-based video compression for wireless surveillance systems. IEEE MultiMedia. 2017; 24(2):76-85.
[Crossref] [Google Scholar]
[39]Ye O, Deng J, Yu Z, Liu T, Dong L. Abnormal event detection via feature expectation subgraph calibrating classification in video surveillance scenes. IEEE Access. 2020; 8:97564-75.
[Crossref] [Google Scholar]
[40]Kumar K, Shrimankar DD. F-DES: fast and deep event summarization. IEEE Transactions on Multimedia. 2017; 20(2):323-34.
[Crossref] [Google Scholar]
[41]Lee S, Kim HG, Ro YM. BMAN: bidirectional multi-scale aggregation networks for abnormal event detection. IEEE Transactions on Image Processing. 2019; 29:2395-408.
[Crossref] [Google Scholar]
[42]Thomas SS, Gupta S, Subramanian VK. Perceptual video summarization—a new framework for video summarization. IEEE Transactions on Circuits and Systems for Video Technology. 2016; 27(8):1790-802.
[Crossref] [Google Scholar]
[43]Ji Z, Xiong K, Pang Y, Li X. Video summarization with attention-based encoder–decoder networks. IEEE Transactions on Circuits and Systems for Video Technology. 2019; 30(6):1709-17.
[Crossref] [Google Scholar]
[44]Pinho RR, Tavares JM, Correia MV. An improved management model for tracking missing features in computer vision long image sequences. Article in International Scientific Journal. 2006.
[Google Scholar]
[45]Song H, Sun C, Wu X, Chen M, Jia Y. Learning normal patterns via adversarial attention-based autoencoder for abnormal event detection in videos. IEEE Transactions on Multimedia. 2019; 22(8):2138-48.
[Crossref] [Google Scholar]
[46]Yan S, Smith JS, Lu W, Zhang B. Abnormal event detection from videos using a two-stream recurrent variational autoencoder. IEEE Transactions on Cognitive and Developmental Systems. 2018; 12(1):30-42.
[Crossref] [Google Scholar]
[47]Huang C, Wang H. A novel key-frames selection framework for comprehensive video summarization. IEEE Transactions on Circuits and Systems for Video Technology. 2019; 30(2):577-89.
[Crossref] [Google Scholar]
[48]Dilawari A, Khan MU. ASoVS: abstractive summarization of video sequences. IEEE Access. 2019; 7:29253-63.
[Crossref] [Google Scholar]