A novel tree pattern-based violence detection model using audio signals
Article
Yildiz, Arif Metehan, Barua, Prabal D., Dogan, Sengul, Baygin, Mehmet, Tuncer, Turker, Ooi, Chui Ping, Fujita, Hamido and Acharya, U. Rajendra. 2023. "A novel tree pattern-based violence detection model using audio signals." Expert Systems with Applications. 224. https://doi.org/10.1016/j.eswa.2023.120031
Article Title | A novel tree pattern-based violence detection model using audio signals |
---|---|
ERA Journal ID | 17852 |
Article Category | Article |
Authors | Yildiz, Arif Metehan, Barua, Prabal D., Dogan, Sengul, Baygin, Mehmet, Tuncer, Turker, Ooi, Chui Ping, Fujita, Hamido and Acharya, U. Rajendra |
Journal Title | Expert Systems with Applications |
Journal Citation | 224 |
Article Number | 120031 |
Number of Pages | 12 |
Year | 2023 |
Publisher | Elsevier |
Place of Publication | United Kingdom |
ISSN | 0957-4174 |
1873-6793 | |
Digital Object Identifier (DOI) | https://doi.org/10.1016/j.eswa.2023.120031 |
Web Address (URL) | https://www.sciencedirect.com/science/article/pii/S095741742300533X |
Abstract | Physical violence detection using multimedia data is crucial for public safety and security. This is an important research area in information security and digital forensics. Research in video-based violence detection (VVD) has grown steadily in recent years with rapid increase in video surveillance systems worldwide. Verbal aggression detection technologies, on the other hand, are still limited due to the popularity of computer vision models. Thus, researchers have preferred to use computer vision models to detect violence using videos. We have presented a new automatic audio violence detection (AVD) model to fill this gap. Our AVD model is handcrafted and its details are as follows. This work collected a new audio dataset on verbal aggression from YouTube. A novel handcrafted model was proposed using multilevel feature extraction, feature selection, classification, and majority voting phases. A new local feature extraction function based on the binary tree was used to generate features from audio signals. We call this function tree pattern-23 (TreePat23), where 23 represents the number of wavelet bands/audio signals inputs. Wavelet bands were generated using tunable Q wavelet transform (TQWT) before being applied to our TreePat23 for feature extraction. The iterative neighborhood component analysis (INCA) and Chi2 were used to select the features. The selected features were classified using k nearest neighbors (kNN) and support vector machine (SVM) followed by iterative majority voting (IMV) method. The best-predicted vector was obtained by using a greedy algorithm. Finally, a new validation technique called leave one record out (LORO) cross-validation (CV) was used to validate the results. Our proposed TreePat23 model has attained classification accuracy of 89.68% and 89.75% with kNN and SVM, respectively. Our developed system has generated 14 results for each classifier and automatically selected the best result. Hence this model is a self-organized audio classification model which yielded over 89% classification accuracy for both classifiers using LORO CV strategy. |
Keywords | Audio forensics; Audio violence detection ; Tree pattern ; Signal processing ; Feature extraction ; Iterative feature selection |
ANZSRC Field of Research 2020 | 400306. Computational physiology |
Public Notes | Files associated with this item cannot be displayed due to copyright restrictions. |
Byline Affiliations | Firat University, Turkey |
School of Business | |
Cogninet Australia, Australia | |
University of Technology Sydney | |
Australian International Institute of Higher Education, Australia | |
University of New England | |
Taylor’s University, Malaysia | |
SRM Institute of Science and Technology, India | |
Kumamoto University, Japan | |
University of Sydney | |
Ardahan University, Turkiye | |
Singapore University of Social Sciences (SUSS), Singapore | |
HUTECH University of Technology, Vietnam | |
University of Granada, Spain | |
Iwate Prefectural University, Japan | |
School of Mathematics, Physics and Computing |
Permalink -
https://research.usq.edu.au/item/z1w4x/a-novel-tree-pattern-based-violence-detection-model-using-audio-signals
41
total views0
total downloads4
views this month0
downloads this month