Translate this page:
Please select your language to translate the article


You can just close the window to don't translate
Library
Your profile

Back to contents

Legal Studies
Reference:

Big Data' and Data Mining' Role in Fight Against Drug Trafficking

Belyaev Ivan Yur'evich

ORCID: 0000-0002-0126-0934

Graduate student, Department of Theory and History of State and Law and International Law, Federal State Autonomous Educational Institution of Higher Education "Samara National Research University named after Academician S.P. Korolev"

443086, Russia, Samarskaya oblast', g. Samara, ul. Moskovskoye shosse, 34

ivanbeliaef@yandex.ru
Other publications by this author
 

 
Belyaev Yurii Vladislavovich

lawyer, Law office N 544

443112, Russia, Samarskaya oblast', g. Samara, ul. Sergeya Lazo, 25A

vana682@gmail.com

DOI:

10.25136/2409-7136.2022.7.37872

EDN:

SBAHNK

Received:

13-04-2022


Published:

08-07-2022


Abstract: The subject of the research is the ways of using modern Big Data and Data Mining methodologies to improve the process of lawmaking and law enforcement practice in the field of illicit trafficking of new psychoactive substances. Based on the experience of European countries and international experience in general, the author proves the need to create a methodology for early detection of the spread of new psychoactive substances that pose a significant threat to public health. This technique, according to the author, can be based on the use of methods of applied statistical analysis of the Google Trends database, which, in turn, can be used to combat drug crime in terms of detecting and predicting crimes. The practical significance of the conducted research lies in the development of a methodology for early detection of the spread of new psychoactive substances, which can be used to optimize the operational legislative response and improve law enforcement practice in the field of combating illicit drug trafficking. The scientific novelty of the conducted research consists in the creation of a predictive rule in the form of a linear regression equation that allows predicting the spread of such a new psychoactive substance as mephedrone in various regions of the Russian Federation, as well as planning operational investigative measures taking into account the identified risk zones. This study may be of interest to law enforcement officials, legal scholars, students and teachers of law schools.


Keywords:

drug addiction, national security, drug trafficking, designer drugs, linear regression equation, comparative studies, early warning system, analogues of narcotic drugs, international law and order, anti-drug legislation

This article is automatically translated.

The last decade has been marked by the spread of a new category of narcotic drugs, called new psychoactive substances (NPS) [1]. The chemical structure of new psychoactive substances is heterogeneous and includes not only patented components, analogues of traditional drugs or prescription medications, but also new substances [2]. Some of these components and their effects are well studied, while the toxicity of others is not known. This is associated with an increase in serious side effects of new psychoactive substances, expressed in mass infections, psychoses, acute poisoning and overdose [3].  As a result, many new psychoactive substances are capable of causing serious harm to the health of their consumers [4].

Since the legal status of new psychoactive substances varies greatly in different countries, only some of them are included in international anti-drug conventions, which facilitates the turnover of these substances at the international level [5].

In response to the rapid spread of new psychoactive substances, the European Monitoring Center for Drugs and Drug Addiction (ECMN) and the United Nations Office on Drugs and Crime (UNODC) have implemented an Early Warning System and an Early Warning Advisory Portal, respectively [6].

For example, as soon as information about a new psychoactive substance arrives at the European Monitoring Center for Drugs and Drug Addiction, the procedure for assessing the risk of harm caused by it begins.  However, despite the informative nature of these monitoring systems, the responsiveness of their response to the rapid emergence and spread of new psychoactive substances is limited [7]. Firstly, the universal method of identifying new psychoactive substances is lagging behind. Secondly, for the transition of the monitoring procedure from the stage of risk assessment to the stage of control of a new psychoactive substance, strict compliance with three criteria is necessary: 1) proof of its psychoactive effect; 2) the ability to frame addiction; 3) dangerous effects on health and social integration. In this regard, the European Monitoring Center for Drugs and Drug Addiction notes the need to supplement the existing early warning system with more sensitive and flexible technological means.

Based on the large number of new psychoactive substances (unlike traditional drugs), as well as the limited nature of information about their effects, the need to optimize the system for obtaining initial data on their distribution becomes particularly relevant. In this regard, many researchers emphasize the special role of the Internet as a source of early detection of new psychoactive substances [8]. Any activity in the online environment leaves digital traces that can be used as an indicator of the appearance of new psychoactive substances. In particular, the history of search queries or communication on forums can be used for these purposes [9].

One of the key studies in this area is a project called "The Psychonaut Web Mapping Project", aimed at developing and using a web map of the appearance and distribution of new psychoactive substances in order to identify new trends over a two-year period [10]. This project has confirmed that with regard to the availability and use of narcotic drugs, the study of online forums allows you to obtain unique information that cannot be obtained by other means (in particular, sociological surveys). Within the framework of this project, it was possible to identify more than 400 new psychoactive substances even before information about them appeared in the scientific literature and became available to the general public.      

The effectiveness of this approach was also confirmed in another study, where one of the well-known online drug forums was monitored [11]. The analysis of the discussions of narcotic drugs conducted there made it possible to identify the direction of evolution of classes of narcotic drugs and individual narcotic components within these classes. Moreover, some of the new psychoactive substances were detected in this way earlier than they were identified by the EU Early Warning System mentioned above.

Another irreplaceable source of information in the field of studying new psychoactive substances is Google Trends. As shown in a number of studies, consumers interested in purchasing narcotic substances previously collect information about them using the Google Search search engine. This tool makes it possible to identify the interest of consumers, which has a high degree of correlation with subsequent actions to purchase the selected narcotic drug. At the same time, the available database for search is extensive and is constantly available almost in real time.

So, in the context of the topic under consideration, Google Trends was used to identify deaths from new psychoactive substances and to study the relationship between search queries and publications in the media. At the same time, the most in-depth studies using Google Trends data were conducted by Al-Imam & Abdul Majeed [12]. In each of these studies, the compliance of Google Trends data with the patterns identified on the Dark Web was confirmed. Additionally, the effectiveness of Goggle Trends as a means of early detection of new psychoactive substances was demonstrated.

Google Trends data was used by Gamma et al. to compare Google Search queries related to methamphetamine and related crimes in three countries (Switzerland, Germany, Austria). A reliable correlation was found between search queries for methamphetamine and the corresponding criminal acts, which can be used as an indicator of drug-related crimes.  

In this regard, we conducted our own research aimed at identifying the possibility of using the Google Trends database to predict criminal acts in the sphere of trafficking of new psychoactive substances in order to further improve criminal legislation and law enforcement practice.

As a hypothesis of the study, we put forward the assumption that there is a correlation between search queries in the Google service for the keywords "mephedrone" and "pawnshop". In the case of finding a correlation between these variables, at the second stage of the study, we assumed a linear regression analysis to create an equation that allows us to predict the value of the dependent variable "pawnshop".

As the primary data for statistical analysis, data from the Google Trends service for the Russian Federation over the past five years were used for two search queries: "mephedrone" and "pawnshop". The choice of these variables was determined by the following reasons.

1. Mephedrone is one of the most popular synthetic drugs in the Russian Federation.

2. Pawning things, including those obtained by criminal means, is one of the most common ways to obtain funds for the purchase of mephedrone.

3. There is a high probability of using the Google Search search service by drug–addicted subjects to find ways to purchase mephedrone, on the one hand, and pawnshop addresses to receive money, on the other.

The consumers of mephedrone of the Russian Federation who use a pawnshop as a means of obtaining money for its purchase act as a general aggregate in our study. As a sample, we used Google Search service users in the Russian Federation who have been searching for mephedrone and pawnshops over the past five years.

The representativeness of this sample is determined by the randomness of the selection of its elements and a large number of them, which, according to the central limit theorem, allows us to consider this distribution obeying the law of normal distribution.

The sample for this study was formed in the Google Trends service when comparing two search queries: "Mephedrone" and "Pawnshop" over the past five years. The results of the query in the form of a file were imported into the Statistical Package for the Social Studies (SPSS) program for further statistical analysis.

A fragment of the file with primary data for statistical analysis is presented below.

Table 1 – A fragment of primary data for statistical analysis   

Date

Mephedrone

Pawnshop

05.03.2017

5

74

12.03.2017

5

67

19.03.2017

4

67

26.03.2017

6

70

02.04.2017

6

77

09.04.2017

5

66

16.04.2017

3

67

23.04.2017

5

74

  

The first column "Date" includes the date of the last day of the week during which the search queries were summed up. Since the sample was formed over a period of five years and the measurement interval is a week, there are 260 rows in this column (the number of weeks in five years).

The second and third columns contain the number of search queries in relative (normalized) percentile indicators. The maximum percentile rank is 100, the minimum is 0. The expression of the number of queries in Google Trends is not in absolute values, but normalized – allows you to use a single measurement scale to compare different queries.

It should be noted that the use of the Big Data methodology and the Google Trends service in our case makes it possible to conduct research that could not be carried out using the traditional method – a sociological survey. In our study, we analyze data collected throughout Russia over the past five years. To conduct such a large-scale sociological survey over such a long period of time would require significant material and time costs. At the same time, even if respondents agreed to answer anonymous questions about the presence of drug dependence on mephedrone and planned to purchase it by pawning things (acquired, including by criminal means), the sincerity of such answers would be doubtful.

The descriptive statistics on the data obtained are given below.

Table 2 – Descriptive statistics

 

Mephedrone

Pawnshop

N

Valid

260

260

Missed

0

0

Average

15,43

79,90

Median

16,00

81,00

Standard deviation

7,687

8,291

Range

42

37

Minimum

3

63

Maximum

45

100

 

As follows from Table 2, the average number of search queries for the keyword "Pawnshop" for a week is more than five times higher than the average number of search queries for the keyword "Mephedrone": 79.9 and 15.4, respectively. At the same time, the average value of search queries for the search query "pawnshop" is in the range of 15.4+-0.9, and for the search query "mephedrone" in the range of 79.9+-1.

 

Figure 1 – Frequency distribution of the variable "Pawnshop"

At the next stage of the study, in order to answer the question whether there is a statistically significant correlation between these variables, we conducted a correlation analysis in the SPSS program. The results obtained are shown below.

Table 3 – Results of correlation analysis

 

Mephedrone

Pawnshop

Mephedrone

Pearson Correlation

1

,574**

Value. (double-sided)

 

,000

N

260

260

Pawnshop

Pearson Correlation

,574**

1

Meaning. (double-sided)

,000

 

N

260

260

**. The correlation is significant at 0.01 (two-way).

 

The obtained correlation coefficient (r=0.57) between the studied variables can be considered as an indicator of a strong relationship between them. As Cohen Jacob notes in one of his books, which has become a classic in the field of behavioral statistics, in social research, a correlation coefficient of 0.1 or less reflects a weak relationship between variables, a value of 0.3 reflects an average degree of connection, and a value of 0.5 or higher indicates a strong connection [13].  

Since at the first stage of the study we established a high degree of connection between the variables, at the second stage we conducted a linear regression analysis. The difference between correlation and regression analysis is that the former measures the strength of the relationship between variables, while the latter measures the nature of this relationship [14].

As emphasized in the latest edition of the monograph by a leading author in the field of Data Mining, linear regression analysis is one of the most common statistical methods at present. Linear regression analysis is used to describe the relationships between variables in scientific research [15].

In the course of linear regression analysis, the value of a variable is predicted based on the value of another variable. The variable that needs to be predicted is called a dependent variable. A variable that is used to predict the value of another variable is called an independent variable.

The dependent variable in our study is the number of search queries for the keyword "Pawnshop". And the independent variable is the number of search queries for the keyword "Mephedrone".

In the course of this analysis, such coefficients of a linear equation with one or more independent variables are selected so that this equation best predicts the value of the dependent variable.

The result of linear regression can be represented as a straight line on a plane that minimizes the discrepancy between the predicted and actual values.

Below is a graph of the ratio of predicted and actual values of the dependent variable based on the constructed regression line.

References
1. Niftalieva, I. A. Problems of the implementation of the principle of justice in the norms establishing responsibility for the illegal circulation of new potentially dangerous psychoactive substances / I. A. Niftalieva // Modern problems of criminal law sciences: collection of articles of the All-Russian Scientific and Practical Conference, Volgograd, 06 -08 July 2021. - Volgograd: IP Chernyaeva Yu.I., 2021. - S. 214-218. – EDN OVDWCJ.
2. M.H. Baumann, R.A. Glennon, J.L. Wiley (Eds.), Neuropharmacology of New Psychoactive Substances (NPS): The Science Behind the Headlines, Springer International Publishing, United States, 2017.
3. E. Deligianni, J.M. Corkery, F. Schifano, L.A. Lione, An international survey on the awareness, use, preference, and health perception of novel psychoactive substances (NPS), Hum. Psychopharmacol.: Clin. Exp. ¹ 32, (2017) e2581.
4. Moiseenkova, M. A. Illicit trafficking of new potentially dangerous psychoactive substances / M. A. Moiseenkova // Preservation of the world in modern conditions: Collection of articles of the International scientific and practical conference dedicated to the 60th anniversary of the International Public Foundation "Russian Peace Fund", Smolensk, May 10–15, 2021 / Authors-compilers: E.A. Ulyanenkova, A.M. Ivanov. Under the general editorship of E.A. Ivanova. - Smolensk: Limited Liability Company "Svitok", 2021. - P. 212-219. – EDN DJZWPK.
5. Belyaev, I. Yu. The problem of control systems for illicit trafficking in narcotic drugs and psychotropic substances / I. Yu. Belyaev // Security Issues. - 2022. - No. 1. - P. 35-47. – DOI 10.25136/2409-7543.2022.1.37626.
6. European Monitoring Centre for Drugs and Drug Addiction, EMCDDA–Europol 2010 Annual Report on the Implementation of Council Decision 2005/387/JHA, Lisbon, Portugal, 2011.
7. G.R. Potter, C. Chatwin, Not particularly special: Critiquing "NPS" as a category of drugs, Drug.: Educ. Prev. Policy ¹ 25 (2018), p. 329–336.
8. Z. Davey, F. Schifano, O. Corazza, P. Deluca, E-psychonauts: Conducting research in online drug forum communities, J. Ment. Health ¹ 21 (2012), p. 386–394.
9. Sezonova, T. V. Features of conducting individual investigative actions in the investigation of the illegal sale of narcotic drugs through an online store / T. V. Sezonova // Actual problems of combating crime: issues of theory and practice: materials of the XXIV international scientific and practical conference, Krasnoyarsk, April 08–09, 2021. - Krasnoyarsk: Siberian Law Institute of the Ministry of Internal Affairs of the Russian Federation, 2021. - P. 253-255. – DOI 10.51980/2021_1_253. – EDN YXAAIQ.
10. P. Deluca, Z. Davey, O. Corazza, L. Di Furia, M. Farre, L.H. Flesland, M. Mannonen, A. Majava, T. Peltoniemi, M. Pasinetti, C. Pezzolesi, N. Scherbaum, H. Siemann, A. Skutle, M. Torrens, P. van der Kreeft, E. Iversen, F. Schifano, Identifying emerging trends in recreational drug use; outcomes from the psychonaut web mapping project, Prog. Neuro-Psychopharmacol. Biol. Psychiatry ¹ 39 (2012), p. 221–226.
11. D. Rhumorbarbe, M. Morelato, L. Staehli, C. Roux, D.-O. Jaquet-Chiffelle, Q. Rossy, P. Esseiva, Monitoring new psychoactive substances: exploring the contribution of an online discussion forum, Int. J. Drug Policy ¹ 73 (2019), p. 273–280.
12. A. Al-Imam, B.A. AbdulMajeed, Captagon, octodrine, and nbome: an integrative analysis of trends databases, the deep web, and the darknet, Glob. J. Health Sci. ¹ 9 (2017), p. 114–125.
13. Cohen, Jacob. (1988). Statistical Power Analysis for the Behavioral Sciences (2nd ed). Hillsdale, NJ: Erlbaum, p. 141.
14. Peter Bruce and Andrew Bruce. Practical Statistics for Data Scientists. O’Reilly Media, Sebastopol, 2017, p.232.
15. Data Mining: Concepts, Models, Methods, and Algorithms, Third Edition. Mehmed Kantardzic. 2020 by The Institute of Electrical and Electronics Engineers, Inc. Published 2020 by John Wiley & Sons, Inc., p. 175.
16. Cohen, Jacob. (1988). Statistical Power Analysis for the Behavioral Sciences (2nd ed). Hillsdale, NJ: Erlbaum. p.114

Peer Review

Peer reviewers' evaluations remain confidential and are not disclosed to the public. Only external reviews, authorized for publication by the article's author(s), are made public. Typically, these final reviews are conducted after the manuscript's revision. Adhering to our double-blind review policy, the reviewer's identity is kept confidential.
The list of publisher reviewers can be found here.

The subject of the study is public relations in the field of countering illicit trafficking of psychoactive substances. The author analyzes the application of Big Data and Data Mining methodologies to improve legal technology and law enforcement practice. The research methodology has a practical focus. At the same time, the author explores the theoretical foundations of the desired problem. Methods such as general legal and empirical ones are used. The author applies modeling and forecasting methods, which fills his article with content. The relevance is due to the emergence of new ways of influencing people with psychotropic substances. At the same time, theory does not keep pace with the practice of such innovations. The scientific novelty consists in the complexity of the author's research, as well as in its results, conclusions and suggestions of the author. The article is prepared in a scientific style, using illustrative material. The author has developed and presented graphs and diagrams reflecting the problems of public relations in the field of countering illicit trafficking in psychoactive substances. The article includes an introductory part summarizing that, based on the large number of new psychoactive substances (unlike traditional drugs), as well as the limited nature of information about their effects, the need to optimize the system for obtaining initial data on their distribution becomes particularly relevant. The main content contains theoretical analysis, but is especially interesting for empirical components. The author puts forward hypotheses and works with them. In particular, it was suggested that there is a correlation between search queries in the Google service for the keywords "mephedrone" and "pawnshop". If a correlation was found between these variables, at the second stage of the study, the author proposed to conduct a linear regression analysis to create an equation that allows predicting the value of the dependent variable "lombard". The author notes that the use of the Big Data methodology and the Google Trends service in our case makes it possible to conduct research that would not be possible using the traditional method – a sociological survey. At the same time, an analysis of data collected throughout Russia over the past five years has been carried out. To conduct such a large-scale sociological survey over such a long period of time would require significant material and time costs. The author emphasizes the uniqueness of the study, since even if the respondents agreed to answer anonymous questions about the presence of drug dependence on mephedrone and planning to purchase it by pawning things (acquired, including by criminal means), the sincerity of such answers would be questionable. Tables, figures and graphs developed by the author are of particular practical interest. The author's conclusions are of scientific interest, such as the fact that modern Big Data and Data Mining methodologies present additional opportunities in the field of optimizing criminal law legislation and law enforcement practice aimed at combating illicit trafficking of new narcotic drugs. Another author's conclusion, which is that using the Google Trends service with subsequent statistical analysis of the results obtained by means of well-known application software packages (for example, the "Statistical Package for Social Sciences" by IBM), allows you to predict criminal behavior associated with drug use. The linear regression equation created by the author is relevant, which can be used to predict and identify crimes aimed at obtaining money by depositing stolen property in a pawnshop for the purchase of narcotic drugs. The bibliography includes 16 latest sources that meet the requirements for writing scientific articles. The presented work is of scientific and practical interest to the readership. I recommend it for publication.