Translate this page:
Please select your language to translate the article


You can just close the window to don't translate
Library
Your profile

Back to contents

Software systems and computational methods
Reference:

The impact of asynchronous and multithreaded query processing models on the performance of server-side web applications

Makarov Igor' Sergeevich

ORCID: 0009-0004-8734-2667

PhD in Technical Science

Head of the Department; Department of Applied Informatics (PI); Volga State University of Telecommunications and Informatics

23 Lva Tolstogo str., Samara region, 443010, Russia

igor-psati@yandex.ru
Larin Denis Vyacheslavovich

ORCID: 0009-0000-1904-7210

Student; Department of Computer Science and Computer Engineering; Volga Region State University of Telecommunications and Informatics

23 Lva Tolstogo str., Samara, Samara region, 443010, Russia

denlar1989@gmail.com
Vorobeva Evgeniia Grigor'evna

ORCID: 0009-0008-8225-7091

Student; Department of Computer Science and Computer Engineering; Volga Region State University of Telecommunications and Informatics

23 Lva Tolstogo str., Samara region, 443010, Russia

vorobeva.g2004@gmail.com
Emelin Daniil Pavlovich

Student; Department of Computer Science and Computer Engineering; Volga Region State University of Telecommunications and Informatics

23 Lva Tolstogo str., Samara, Samara region, 443010, Russia

demelin163@gmail.com
Kartashov Dmitry Aleksandrovich

Student; Department of Computer Science and Computer Engineering; Volga Region State University of Telecommunications and Informatics

23 Lva Tolstogo str., Samara region, 443010, Russia

tawerka40@gmail.com

DOI:

10.7256/2454-0714.2025.1.73665

EDN:

UZPPWT

Received:

12-03-2025


Published:

03-04-2025


Abstract: The object of the study is server-side web applications and their performance when processing a large number of simultaneous requests. Asynchronous technologies (Node.js, Python Asyncio, Go, Kotlin Coroutines) and multithreaded models (Java Threading, Python Threading). The authors analyze asynchronous event loops, goroutines, coroutines, and classical multithreaded approaches in detail, evaluating their effectiveness in tasks with intensive use of I/O and computing resources. An experiment is underway with API development in three languages (Java, Node.js, Go) and testing using the hey utility. It also explores the features of scalability, performance optimization, caching, error handling, load tests, and implementation features of parallel computing. The purpose of the study is to determine which approaches provide the highest performance in server applications.  Research methods include load testing, collection of metrics (response time, bandwidth, and server resource consumption), and analysis of the results. The scientific novelty lies in comparing asynchronous and multithreaded methods in real-world web development scenarios. The main conclusions of the study are recommendations on the use of asynchronous technologies in high-load I/O tasks and multithreading in computationally complex scenarios. The results obtained will help developers optimize the performance of server applications depending on their tasks and workload. Additionally, the study examines aspects of the complexity of debugging asynchronous applications, the impact of thread pools on the performance of multithreaded solutions, as well as scenarios in which asynchronous and multithreaded approaches can complement each other. Special attention is paid to server resource management under scalable loads, which will allow IT specialists to more accurately select tools and technologies for solving specific tasks. In conclusion, possible ways to optimize the operation of server applications are discussed, including the use of new approaches and algorithms, as well as the prospects for the development of asynchronous and multithreaded technologies in the context of highly loaded systems, their impact on the overall application architecture, as well as on increasing fault tolerance and security.


Keywords:

asynchrony, multithreading, efficiency, server-side web applications, load testing, Node.js, Python Asyncio, Go goroutines, Java Threading, performance of server applications

This article is automatically translated.

Introduction

Modern server-side web applications face increasing demands to handle a large number of simultaneous requests. As the number of users increases and operations become more complex, especially in applications running with APIs, databases, and external services, server-side performance becomes critical. In this context, the choice between asynchronous and multithreaded approaches becomes very important.

Asynchronous technologies such as Node.js (event loop), Python Asyncio (concurrency, managed events), Go (goroutines), Kotlin Coroutines, offer opportunities to improve performance due to non-blocking I/O operations and ease of managing a large number of simultaneous connections. On the other hand, multithreaded models such as Java Threading and Python Threading provide reliability and simplicity of logic, but can have overhead costs when creating a large number of threads. [1, c.93-95],[2]

This article examines the performance of server-side web applications using asynchronous and multithreaded approaches.

2. Overview of technologies and approaches

Asynchronous event loop.

Asynchronous event loop is represented by technologies such as Node.js and Python Asyncio. This approach involves using a single thread to manage a large number of requests via non-blocking I/O. This allows for high performance in I/O operations, but complicates debugging and requires the use of asynchronous libraries.[5]

Goroutines and coroutines.

Go and Kotlin Coroutines use lightweight language runtime-driven streams. They provide high parallelism and are particularly effective in resource-intensive tasks. However, the developer needs to explicitly specify competitive computing. [3, c.260-261],[4]

Multithreaded programming.

The multithreaded model (Java, Python Threading) allocates a separate thread for each request or uses a thread pool. This approach is useful for tasks that require parallel processing, but it requires careful design and synchronization to avoid errors and performance losses. [1, c.93-95]

The comparative table 2.1 of these technologies is shown below.:

Table 2.1 – comparison of technologies and approaches

Approach

Technologies

The principle of operation

Advantages

Disadvantages

Asynchronous event loop

Node.js, Python Asyncio

A single thread manages a large number of requests through non-blocking I/O.

High performance in I/O operations

The complexity of debugging, the need to adapt to asynchronous libraries

Goroutines and coroutines

Go, Kotlin Coroutines

Lightweight flows controlled by the runtime of the language

High parallelism, efficiency in resource-intensive tasks

Requires an understanding of competitive programming

Multithreaded programming

Java Threading, Python Threading

Each request receives a separate thread (or pool of threads)

Good language support, simple logic

High overhead for thread creation, resource locking

3. Research methodology

To conduct the research, a server application with an API in three languages was developed: Java (multithreaded), Node.js (asynchronous), Go (goroutines). The API endpoint processes an incoming request, reads data from the database (delay emulation), and sends a response. Load tests were performed with a different number of simultaneous requests (50, 100, 500, 1000, 5000). The average response time, bandwidth, and server resource usage (CPU, RAM) were evaluated.

4. Implementation

Load tests were performed for different load levels: 50, 100, 500, 1000, and 5000 simultaneous requests, which allowed us to identify how each approach copes with the increased load. For each test scenario, we measured the following parameters:

Average response time is the average time that has elapsed since the server received the request before sending the response to the client. An important indicator for evaluating the responsiveness of an application (Table 4.1).

Table 4.1 – Average response time

Language

Low load (50 requests)

High load (5000 requests)

Java

~304-306 ms (delay + overhead)

~1094-8966 ms (increase due to thread blocking)

Node.js

~236-248 ms (almost no overhead)

~401-785 ms (event loop handles well)

Go

~212-216 ms (slightly higher due to scheduler)

~384-549 ms (effective management of goroutines)

Node.js and Go have lower latency under high load due to asynchronous processing and ease of creating new tasks.

Throughput (RPS) is the number of requests that the server is able to process per unit of time (Table 4.2).

Table 4.2 – Throughput

Language

Low load (50 requests)

High load (5000 requests)

Java

~304-306 ms (delay + overhead)

~380-9087ms (increase due to thread blocking)

Node.js

~233-249 ms (almost no overhead)

~355-800 ms (event loop handles well)

Go

~211-217 ms (slightly higher due to scheduler)

~343-560 ms (effective management of goroutines)

Go leads the way because of the ease of mining and the fast scheduler, followed by Node.js Java is slowing down due to the cost of thread management and context switching.

We used the hey utility to perform load testing.

hey is a modern HTTP server load testing tool written in the Go language. It can be considered a modern analogue of utilities such as ApacheBench or wrk, but with a number of advantages:

- Ease and efficiency: hey allows you to quickly emulate thousands of simultaneous requests, which is ideal for testing the scalability of a system under real load conditions.

- Ease of use: to run the test, it is enough to specify the total number of requests and the number of simultaneous connections.

The integration of the hey utility into the load testing process allowed us to gain a deep understanding of the server's behavior under various load levels, which is a key point in optimizing web applications and ensuring their stable operation even under peak load conditions.[6]

CPU and RAM usage — how much computing resources the server consumes when processing requests. To investigate them, the Process Explorer utility was used (and also the built-in PowerShell monitoring for a specific Get-Process process | Where-Object { $_.ProcessName -like "java" } | Select-Object Name, CPU, WS). This tool shows the CPU usage, RAM, and other system information for the selected process (Table 4.3). [7]

Process Explorer is an advanced utility from Microsoft (Sysinternals) that allows you to monitor active processes in detail and analyze their use of system resources, including CPU (central processing unit) and RAM (RAM). [7]

Table 4.3 – CPU and RAM usage

RAM

CPU

Language

Low load

(50 requests)

High load (5000 requests)

Low load (50 requests)

High load (5000 requests)

Java

97.38 MB

131.32 MB

0,38%

6,82%

Node.js

65.79 MB

110.73 MB

0,38%

8,45%

Go

11.09 MB

82.77 MB

0,25%

3,80%

Go consumes the least amount of memory due to the economical goroutines. Node.js stays within the normal range, while Java requires more memory due to the heavy threading model and JVM operation.

Based on the tests performed, it can be concluded that Java is suitable for enterprise applications with complex logic, but loses in terms of resource consumption and latency under very high load. Node.js is great for I/O-dependent tasks and rapid development. Better performance under medium loads. Go is ideal for high-load systems and requires fewer resources. Wins with a large number of requests.

Conclusion

The study showed the importance of choosing the appropriate query processing model for server-side web applications, depending on the specifics of the load. Asynchronous technologies such as Node.js and Go have demonstrated high efficiency in solving tasks related to intensive use of input-output (I/O), ensuring stable performance even under high loads. These technologies have proven to be more preferable in scenarios with a large number of simultaneous requests, where it is important to minimize delays and resource intensity.

References
1. Opivalov, S. A. (2023). Methods of working with streams in Java. International Journal of Humanities and Natural Sciences, 4(79), 93-99.
2Node.js Guide, Part 1: General information and getting started. Retrieved March 3, 2025, from https://habr.com/ru/companies/ruvds/articles/422893/
3. Opivalov, S. A. (2023). Prospects for the use of Kotlin in programming. International Journal of Humanities and Natural Sciences, 9(84), 260-262.
4Parallel programming in Go. Retrieved March 3, 2025, from https://proglib.io/p/parallelnoe-programmirovanie-v-go-2021-05-23?ysclid=m7wag43tb6712881695
5Asynchronous functions and the Node.js event loop. Retrieved March 4, 2025, from https://translated.turbopages.org/proxy_u/en-ru.ru.137d7d27-67c89218-0c37f6b9-74722d776562/https/www.geeksforgeeks.org/asynchronous-functions-and-the-node-js-event-loop/
6Load testing using Hey. Retrieved March 4, 2025, from https://dev.to/saantoryuu/load-testing-using-hey-c84
7Process Explorer v17.06. Retrieved March 2, 2025, from https://learn.microsoft.com/en-us/sysinternals/downloads/process-explorer

Peer Review

Peer reviewers' evaluations remain confidential and are not disclosed to the public. Only external reviews, authorized for publication by the article's author(s), are made public. Typically, these final reviews are conducted after the manuscript's revision. Adhering to our double-blind review policy, the reviewer's identity is kept confidential.
The list of publisher reviewers can be found here.

The topic is especially relevant for cloud computing and fog computing globally distributed computing systems, where the workload of users in various regions of the world is characterized by large data flows of parallel processing of web requests (a worldwide big data problem), especially on the server and database side. The purpose of the work is not clearly formulated, although it is intuitively clear that this is to increase the performance of web applications on the server side in conditions of increased user load. The paper provides a comparative overview of asynchronous and multithreaded approaches based on the performance criterion of server applications. Node is considered as an example of asynchronous technologies.js, Python Asyncio, Go, Kotlin Coroutines, and multithreaded Java Threading and Python Threading. The advantages and disadvantages of each model are clearly outlined and practical recommendations for their use for developing server applications are given, depending on the specifics of the load, which is presented as the main novelty of the work. The research has been brought to software implementation in the form of load tests, which is undoubtedly the advantage of this work. Numerical results of the average response time and throughput are given. It is revealed that both approaches cope with an increase in user load. The minor difference is in programming methods, RAM usage, and integration with other software systems. The article is presented in a competent technical language that is understandable to the readership in this subject area. The structure of the material design, introduction, overview of technologies and approaches, research methodology, implementation, conclusion, bibliography, meet the requirements of the journal. Tables of acceptable quality according to the criterion of readability. No critical comments were found. The work can be published and corresponds to the subject of the journal "Software Systems and Computational Methods". To improve the quality of the work and the interest of the readership, the following recommendations can be taken into account: 1. The bibliography contains an insufficient number of 7 sources, of which 5 are links to online resources. It is recommended to add scientific publications in journals, preferably publicly available. 2. There is a grammatical error in bibliography No. 1. 3. The link in bibliography No. 5 does not open. 4. It is not customary to use the letter "e" in the text of the article, it may be incorrectly displayed on Internet sites and in citation systems, it should be replaced with the letter "e". 5. The phrases "we measured" and "we used" are abstract, it is better to specify who, and preferably a link to the source of information. 6. Table 4.3 - it is necessary to specify the unit of measurement, megabit (MB) or megabyte (MB). 7. The abbreviation I / O is deciphered in the conclusion, and not at the first mention, and in general it is not necessary to decipher I / O, CPU, RAM, because they are generally accepted in this subject area. General conclusion: the article can be accepted for publication in the journal "Software Systems and Computational Methods" with minor modifications.