Library
|
Your profile |
National Security
Reference:
Novikov A.V.
Forecasting the risk of terrorist attacks based on machine learning algorithms
// National Security.
2022. ¹ 1.
P. 28-44.
DOI: 10.7256/2454-0668.2022.1.36596 URL: https://en.nbpublish.com/library_read_article.php?id=36596
Forecasting the risk of terrorist attacks based on machine learning algorithms
DOI: 10.7256/2454-0668.2022.1.36596Received: 06-10-2021Published: 15-03-2022Abstract: This article is devoted to the analysis and prediction of the risk of terrorist acts based on a comparison of various machine learning algorithms. In order to determine the most important indicators, more than thirty external and internal risk factors are comprehensively considered by quantifying them and an initial set of initial data is built. The study analyzes multidimensional socio-economic and political data for 136 countries for the period from 1992 to 2020. Four indicators are also predicted, reflecting the expected success of terrorist attacks, the likelihood of socio-economic consequences and general damage from terrorism. In addition to the classical analysis models, the effectiveness of the other four machine learning algorithms that can be used to analyze multidimensional data is compared. To predict the risk of terrorist attacks, a random forest model is created, and the effectiveness and accuracy of the model are evaluated based on statistical criteria. To determine the most important initial indicators, the method of recursive elimination of features in a random forest was used. The main result of this study is to identify the most important indicators for predicting the risk of terrorism and to reduce redundant indicators, which makes it possible to improve understanding of the main characteristics of attacks. Meanwhile, the results show that it is necessary to take appropriate proactive measures not only in the form of forceful detention, intelligence and response operations, but also to improve the stability of the state, achieve social balance and improve the quality of life of citizens. Keywords: terrorism, terrorist risk, risk factors, machine learning, random forest, model, countering terrorism, forecasting, social consequences, material and economic consequencesThis article is automatically translated. Introduction Terrorist attacks, as one of the most urgent types of conflicts, usually occur unexpectedly, lead to human casualties and eventually sow chaos [18]. According to the Global Terrorism Database (GTD), the number of terrorist attacks in the world has grown from an average of 1,000 cases per year at the beginning of the XXI century to more than 7,000 cases in 2020 [25]. Although the Governments of many countries have spent huge sums on the fight against terrorism during this period, the results have not been as impressive as they could have been. Counter-terrorism operations, such as pre-empting and suppressing terrorist attacks, are widely used, but sometimes they can further fuel attacks rather than prevent them [12]. Forecasting the risk of terrorist acts is useful for developing preventive measures, as well as for ensuring more targeted long-term policy formation. However, the factors that affect the risk of terrorist attacks are numerous and complexly interrelated. This ambiguity complicates forecasting and can lead to rash political decisions that endlessly delay resources and create panic among the population. Therefore, it is very important to identify the most important indicators affecting the risk of terrorist acts, on the basis of which forecasting will become clearer and more reliable, and decisions on countering terrorism will become more active [9]. In recent years, thanks to the development of machine learning technologies and their ability to effectively identify multifactorial relationships, relevant scientific papers are gradually overcoming the seemingly "unpredictable" problem of predicting terrorist attacks [5]. These works can be divided into two categories. In the first category, indicators of the level of individual terrorist incidents are mainly used to predict the future nature of terrorism. In the work of N. Mo, 56 initial attributes of a terrorist attack were selected from the GTD database, including time, location, type of attack, etc., and the types of terrorism were classified using the support vector machine (SVM), naive Bayesian classifier (NB) and logit regression (LR) [31]. K. Minu's research uses wavelet neural networks and GARCH models to predict the future time of terrorist attacks based on historical data on the number of attacks per month [30]. H. M. Ismail and H. Kazi applied 16 attributes of the level of a specific incident (such as year, type of attack, number of perpetrators, etc.) to assess the nature of terrorist attacks based on an ensemble classifier combining Bayesian models and decision tree (DT) models [26]. In the work of R. T. Brandt, the Markov switching Bayesian model was used to predict the intensity of future conflicts based on previous terrorist incidents [13]. Also, a number of Russian authors used Bayesian models and random forests to model the probability of terrorism [3, 5]. However, in the works of the first category, only micro-level indicators related to specific attacks are considered, and the influence of the "root factors" of terrorism, which are otherwise associated with macro-conditions or the external environment, is ignored [6]. The attacks take place in the broader context of terrorism and these complex conditions encompass political, economic, religious and other factors. An extensive group of works focused on the root causes of terrorism, starting with M. Crenshaw as the most representative early work [19]. In this literature, the methodologies used include qualitative analysis based on theoretical propositions and quantitative empirical studies using statistical tools. The main conclusions concerning the external factors of terrorism include: 1) poverty cannot directly lead to terrorism [10]; 2) democracy reduces the direct costs associated with terrorist attacks, but also increases the relative costs [22]; 3) urbanization is a breeding ground for terrorism [19]; 4) countries with growing populations seem to suffer less terrorism [21]; 5) terrorism is closely connected with other types of political violence, conflicts and wars [14]. In general, a consensus can be reached that the origin and spread of terrorism are caused by broad and far-reaching political, economic, ethnic and other problems [2]. However, these factors are insufficient to explain specific cases of attacks [4]. As part of the forecasting efforts in time series, it is necessary to take into account both the factors of incidents as internal causes of terrorist attacks and external conditions. Based on this, studies of the second category on the prediction of attacks take into account macro-level factors when making forecasts. S. Perry considers 30 economic indicators that measure factors such as unemployment, income and predicts mortality from terrorism using neural networks of back propagation (BP) [33]. N. V. Weidman and M. D. Ward also use four indicators of external conditions containing population size, ethnic composition and landscape to predict conflicts at the municipal level based on a spatial-temporal logistic model [37]. In the work of M. Hao uses a random estimate of the density of forests and cores to predict the potential risk of terrorist attacks on the Indochina peninsula [24]. Mainly in this work with the help of 15 indicators of external conditions containing three social indicators (fragility of the state, population density, the spread of drug trafficking) and eleven geographical indicators, such as average temperature and topography. In contrast to this approach, which considers only indicators of root causes, F. Ding considers both characteristics of incidents and macro-level indicators [20]. It uses a reverse propagation neural network (BPNN), SVM and random forest (RF) to predict the risk of terrorist attacks in various countries, mainly analyzing three incident indicators (such as latitude and longitude) and 10 macro-causes containing five geographical indicators (for example, average rainfall), as well as four demographic indicator. In the second category of studies on the prediction of terrorist attacks, the status of environmental factors was assessed. Nevertheless, existing research in the field of risk assessment is mainly aimed at improving the accuracy of forecasting based on pre-selected indicators representing various factors related to terrorist acts, and less often return to the validity of the choice of indicators [7]. This is partly due to the fact that the "black box" of machine learning models with numerous parameters and complex conclusions makes it difficult to clearly analyze the causal relationships between target dependent and independent variables [34]. Thus, the internal mechanism of the learning model is not clear enough and the relationship between the indicators cannot be understood intuitively. At the same time, after changing the input indicators, the initial conclusions cease to be reliable [15]. Methodology The general structure of the proposed approach consists of four stages. Forecasting the risk of terrorist attacks in different countries for a particular year is carried out using a sliding window with several inputs and outputs. The risk of terrorist attacks as a predicted target consists of four main sub - components: 1) Will there be more than one major terrorist attack next year? 2) What is the maximum level of property damage caused by terrorist acts next year? 3) What is the maximum level of human casualties as a result of terrorist attacks in the coming year? 4) What is the average success rate of terrorist attacks next year? Since it is impractical to consider each terrorist attack equally important and pay equal attention to them if no distinction is made between the severity of the attacks [18]. In general, this study is more concerned with the risk of attacks with severe consequences. As for the input data (independent variables) for forecasting, both external and internal factors are taken into account. External factors play the role of an invisible hand and represent various aspects of the state of social anomie, directly leading to the spread of terrorism and indirectly to terrorist attacks [11]. Internal factors are both triggers and consequences of terrorist attacks, fueling the terrorist phenomenon through a self-replicating feedback loop [27]. In total, 28 indicators were selected as predictors, of which 17 are macro–level indicators, and the remaining 11 are related to the internal characteristics of attacks. The initial data used in this study are collected from three well-known databases, such as the Global Terrorism Database (GTD), the International Country Risk Guide (ICRG) and the World Bank database [35, 36, 38]. GTD is one of the largest databases on terrorist attacks with open access, in which the target, type of attack, location and other indicators of terrorist attacks have been recorded since 1970. The ICRG registers the indicators of each country for each year in three subcategories of risks: political, financial and economic. The World Bank database contains reliable statistical socio-economic data at the country level. Selection of indicators and data preprocessing Terrorist attacks stem from the abstract context of terrorism, in which various factors contribute to the process of radicalization, and indicators are a quantitative interpretation of factors. As shown in Figure 1, the process of transition from radicalism to terrorist attacks consists of several stages [12]. First, terrorism is spreading because of deep-rooted structural causes. Secondly, some citizens become terrorists under the influence of external circumstances and subjective individual causal factors. Finally, due to specific triggers or reproducible feedback from previous acts, terrorists commit more and more attacks [8]. Fig. 1. The process of transition to terrorism under the influence of external factors. Source: compiled by the author.
Structural, accelerating and motivational factors are considered as three subcategories of the causes of terrorism based on T. Bjorgo's research [12]. Structural causes are factors affecting people's lives at the macro level that people may or may not be aware of. Accelerating factors make terrorism "attractive" and encourage people to become more prone to political violence or joining terrorist organizations. Motivational factors are people's personal experiences that encourage them to use terrorist tactics, including joining radical organizations or preparing for terrorist attacks. Factors in these three subcategories primarily contribute to the spread of terrorism and the recruitment of militants. Moreover, motivational reasons can also potentially lead to attacks, but only at the preparatory stage. Provoking and feedback factors are considered as two components of causes at the level of specific incidents. Provoking factors are direct precursors of specific attacks. For example, specific conflicts, controversial events, etc. Feedback factors basically represent the entire previous cycle of terrorist attacks. After the commission of a terrorist act, many audiences participate in the subsequent process, including allies, enemies, members of a terrorist organization, innocent civilians. The consequences, intentions and details of the attacks can be disclosed to the media, which can lead to an escalation effect, further increasing the risk of the next attack. This work focuses on identifying the most important indicators that affect the risk of terrorist attacks from the point of view of forecasting. The factors of the level of root causes and the level of incidents of terrorist acts are comprehensively considered. Internal factors play the role of an invisible hand and represent various aspects of the conditions of social disorder. The factors associated with terrorist attacks are both consequences and triggers of terrorist attacks. To quantify these abstract factors, a total of 28 indicators were selected as the initial set of input predictors of the proposed RF forecasting model. Based on this, the RF-RFE method is proposed to determine the most important indicators by recursively reducing the number of variables. The results of the study show that the minimum set of input indicators before the effectiveness of forecasting deteriorates significantly includes: the number of victims of attacks in the previous year, GDP growth, military spending, population growth rates, population size, unemployment, urban population growth, internal conflicts. The identified factors should be considered important and necessary for understanding the risk of terrorist attacks. The most important indicators identified indicate that the terrorist risk is due to both the root causes of terrorism and previous incidents of political violence. It is known that efforts to reduce the risk of terrorist attacks can be undertaken in two directions. Regarding the first aspect, it is necessary to constantly eliminate the root causes of terrorism by increasing the stability and legitimacy of the Government, reducing intra-State conflicts and contradictions, increasing freedom of entrepreneurship, increasing the dynamics of economic growth and reducing unemployment. As for the second aspect, measures are needed to prevent the growth of terrorism by minimizing the positive impact of previous terrorist attacks on possible future attacks. Key actions include reducing the likelihood of success of terrorist attacks, reducing the number of human casualties and the material and economic losses they cause. In addition, it is necessary to minimize the dissemination of negative information, such as statements about the intentions of terrorist organizations. The main limitation of this article is that due to the complexity of data collection and the relative sensitivity of the sample, only 28 indicators were considered. Future studies should analyze additional factors of terrorism development and focus on the primary impact of political risks, such as government stability. References
1. Gruzdev, A. V. Prognoznoe modelirovanie v IBM SPSS Statistics, R i Python: metod derev'ev reshenii i sluchainyi les. – M.: DMK Press, 2018. – 642 s.
2. Maksimov D.A. Ekonomicheskaya bezopasnost' predpriyatiya: otsenka i upravlenie // Slavyanskii forum. – 2016. – ¹ 3 (13). – S. 147-153. 3. Makhutov, N. A. Ispol'zovanie baiesovskikh setei dlya otsenki terroristicheskikh riskov i vybora optimal'noi strategii protivodeistviya terroristicheskoi ugroze / N. A. Makhutov, D. O. Reznikov // Problemy bezopasnosti i chrezvychainykh situatsii. – 2007. – ¹ 5. – S. 43-63. 4. Mashechkin I.V., Petrovskii M.I., Tsarev D.V., Chikunov M.N. Metody mashinnogo obucheniya dlya zadachi obnaruzheniya i monitoringa ekstremistskoi informatsii v seti internet // Programmirovanie. – 2019. ¹ 3. – S. 18-37. 5. Molotnikova, A. A., Zvonkova D. V. Ocherk problem terrorizma i prognozirovanie teraktov s ispol'zovaniem iskusstvennykh neironnykh setei / A. A. Molotnikova, D. V. Zvonkova // Nauka i obrazovanie: khozyaistvo i ekonomika; predprinimatel'stvo; pravo i upravlenie. – 2018. – ¹ 3(94). – S. 121-131. 6. Mokhaddam, F. M. Terrorizm s tochki zreniya terroristov: chto oni perezhivayut i dumayut i pochemu obrashchayutsya k nasiliyu / [per. V. A. Sosin]. – M.: Forum, 2011. – 286 s. 7. Tikhomirov N.P., Maksimov D.A., Shcherbakov A.V. Verifikatsiya prognozov na osnove analiza riskov ikh oshibok // Vestnik Rossiiskoi ekonomicheskoi akademii im. G.V. Plekhanova. – 2011. – ¹ 3(39). – S. 103-113. 8. Tikhomirova T.M., Sukiasyan A.G. Statisticheskii analiz riskov sotsial'noi napryazhennosti v regionakh RF // Innovatsii na osnove informatsionnykh i kommunikatsionnykh tekhnologii. – 2014. – ¹ 1. – S. 617-619. 9. Tramova A. M. Upravlenie riskami i bezopasnost'yu / A. M. Tramova, I. A. Kiseleva, N. E. Simonovich [i dr.]. – Nal'chik : Federal'noe gosudarstvennoe byudzhetnoe obrazovatel'noe uchrezhdenie vysshego obrazovaniya "Kabardino-Balkarskii gosudarstvennyi agrarnyi universitet imeni V.M. Kokova", 2017. – 182 s. 10. Abadie, A. Poverty, political freedom, and the roots of terrorism // American Economic Review. – 2006. – V. 96 (2). – P. 50-56. 11. Agnew, R. A general strain theory of terrorism // Theoretical Criminology. – 2010. – V. 14(2). – P. 131-153. 12. Bjorgo, T. Strategies for Preventing Terrorism / Palgrave Macmillan. – London, England. – 2013. 13. Brandt, P.T., Freeman, J.R., Schrodt, P.A., Real time, time series forecasting of inter-and intra-state political conflict // Conflict Management and Peace Science. – 2011. – V. 28(1). – P. 41-64. 14. Campos, N.F., Gassebner, M., International terrorism, domestic political instability, and the escalation effect // Economics and Politics. – 2013. – V. 25(1). – P. 27-47. 15. Casalicchio, G., Molnar, C., Bischl, B., 2018. Visualizing the feature importance for black box models. 2018. Rezhim dostupa: https://arxiv.org/pdf/1804.06620.pdf (data obrashcheniya 01.10.2021). 16. Chen, C., Reniers, G., Khakzad, N. Cost-benefit management of intentional domino effects in chemical industrial areas // Process Safety and Environmental Protection. – 2020. – V. 134. – P. 392-405. 17. Chen, X., He, Z., Chen, Y., Lu, Y., Wang, J., Missing traffic data imputation and pattern discovery with a bayesian augmented tensor factorization model // Transportation Research Part C Emerging Technologies. – 2019. – V. 104(7). – P. 66-77. 18. Clauset, A., Young, M., Gleditsch, K.S. On the frequency of severe terrorist events // Journal of Conflict Resolution. – 2007. – V. 51(1). – P. 58-87. 19. Crenshaw, M. The causes of terrorism // Comparative Politics. – 1981. – V. 13(4). – P. 379-399. 20. Ding, F., Ge, Q., Dong, J., Fu, J., Hao, M., Yang, J.M., Understanding the dynamics of terrorism events with multiple-discipline datasets and machine learning approach. PLoS ONE. – 2017. – V. 12(6). doi: 10.1371/journal.pone.0179057 21. Drenher, A., Fischer, J., Government decentralization as a disincentive for transnational terror? an empirical analysis // International Economic Review. – 2010. – V. 51(4). – P. 981-1002. 22. Frey, B.S., Luechinger, S., How to fight terrorism: Alternatives to deterrence // Defence and Peace Economics. – 2003. – V. 14(4). – P. 237-249. 23. Guyon, I., Weston, J., Barnhill, S., Vapnik, V., Gene selection for cancer classification using support vector machines // Machine Learning. – 2002. – V. 46(1). – P. 389-422. 24. Hao, M., Jiang, D., Ding, F., Fu, J., Chen, S. Simulating spatio-temporal patterns of terrorism incidents on the Indochina peninsula with GIS and the random forest method. International Journal of Geo-Information. – 2019. – V. 8(3). – P. 1-19. 25. Institute for Economics & Peace. Global Terrorism Index 2020: Measuring the Impact of Terrorism, Sydney, November 2020. Rezhim dostupa: https://www.visionofhumanity.org/wp-content/uploads/2020/11/GTI-2020-web-1.pdf (data obrashcheniya 01.10.2021). 26. Ismail, H.M., Kazi, H. Use of predictive modeling for prediction of future terrorist attacks in Pakistan // International Journal of Computer Applications. – 2018. – V. 179(15). – P. 8-16. 27. Lewis, E., Mohler, G., Brantingham, P.J., Bertozzi, A.L. Self-exciting point process models of civilian deaths in Iraq // Security Journal. – 2012. – V. 25(3). – P. 244-264. 28. Liu, J., Musialski, P., Wonka, P., Ye, J. Tensor completion for estimating missing values in visual data // IEEE International Conference on Computer Vision. – 2013. – V. 35(1). – P. 208-220. 29. Marchese Robinson, R.L., Palczewska, A., Palczewski, J., Kidley, N., 2017. Comparison of the predictive performance and interpretability of random forest and linear models on benchmark data sets // Journal of Chemical Information and Modeling. – 2017. – V. 57(8). – P. 1773-1792. 30. Minu, K., Lineesh, M., John, C.J. Wavelet neural networks for nonlinear time series analysis // Applied Mathematical Sciences. – 2010. – V. 4(50). – P. 2485-2495. 31. Mo, H., Meng, X., Li, J., Zhao, S. Terrorist event prediction based on revealing data / In: 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA), Beijing, China, 2017.– P. 239-244. 32. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E. Scikit-learn: Machine learning in Python // Journal of Machine Learning Research. –2011. – V. 12. – P. 2825-2830. 33. Perry, C. Machine Learning and Conflict Prediction: A Use Case // Stability: International Journal of Security and Development. – 2013. – V. 2(3). – P. 56. 34. Semmelbeck, J., Besaw, C. Exploring the determinants of crime-terror cooperation using machine learning // Journal of Quantitative Criminology. – 2020. – V. 36(1). – P. 527-558. 35. The Global Terrorism Database (GTD). 2021. Rezhim dostupa: https://www.start.umd.edu/gtd/ (data obrashcheniya 01.10.2021). 36. The International Country Risk Guide (ICRG). 2021. Rezhim dostupa: https://www.prsgroup.com/explore-our-products/international-country-risk-guide/ (data obrashcheniya 01.10.2021). 37. Weidmann, N.B., Ward, M.D. Predicting conflict in space and time // Journal of Conflict Resolution. –2010. – V. 54(6). – P. 883-901. 38. World Bank Open Data. 2021. Rezhim dostupa: https://data.worldbank.org/ (data obrashcheniya 01.10.2021) |