Makarova I.L., Ignatenko A.M., Kopyrin A.S. —
Detection and interpretation of erroneous data in statistical analysis of consumption of energy resources
// Software systems and computational methods. – 2021. – ¹ 3.
– P. 40 - 51.
DOI: 10.7256/2454-0714.2021.3.36564
URL: https://en.e-notabene.ru/itmag/article_36564.html
Read the article
Abstract: Monitoring and analysis of consumption of energy resources in various contexts, as well as measuring of parameters (indicators) in time are of utmost importance for the modern economy. This work is dedicated to examination and interpretation of the anomalies of collecting data on consumption of energy resources (on the example of gas consumption) in the municipal formation. Gas consumption is important for the socioeconomic sphere of cities. Unauthorized connections are the key reason for non-technological waste of the resource. The traditional methods of detection of stealing of gas are ineffective and time-consuming. The modern technologies of data analysis would allow detecting and interpreting the anomalies of consumption, as well as forming the lists for checking the objects for unauthorized connections. The author’s special contribution lies in application of the set of statistical methods aimed at processing and identification of anomalies in energy consumption of a municipal formation. It is worth noting that the use of such technologies requires the development of effective algorithms and implementation of automation and machine learning algorithms. The new perspective upon time-series data facilitates identification of anomalies, optimization of decision-making, etc. These processes can be automated. The presented methodology tested on time-series data that describes the consumption of gas can be used for a broader range of tasks. The research can be combined with the methods of knowledge discovery and deep learning algorithms.
Kopyrin A.S., Kopyrina A.O. —
Building a standard system of inference rules based on the knowledge base
// Software systems and computational methods. – 2021. – ¹ 1.
– P. 1 - 9.
DOI: 10.7256/2454-0714.2021.1.34798
URL: https://en.e-notabene.ru/itmag/article_34798.html
Read the article
Abstract: The authors propose to combine logical inference with the apparatus of fuzzy sets. When each solution is associated with a set of possible outcomes with known conditional probabilities, the solution is chosen based on digital information under conditions of uncertainty.
Therefore, the main purpose of using fuzzy logic in expert systems is to create computing devices (or software complexes) capable of simulating human thinking and explaining decision-making methods
The purpose of the work is to describe in detail a reproducible standard method of constructing rules for the output of an expert system for various economic subject areas, using a universal knowledge base scheme To make decisions in a fuzzy system, it is proposed to use the process of identifying the structure of a rule - determining the structural characteristics of a fuzzy system, such as the number of fuzzy rules, the number of linguistic terms into which incoming variables are divided. This identification is carried out using fuzzy cluster analysis, which is carried out using fuzzy decision trees. The authors present a block diagram of the inference methodology based on fuzzy logic. The method of constructing rules and the algorithm of fuzzy inference presented in the article can be used in various spheres of the economy. The novelty of the work lies in the automation and integration of the system for determining fuzzy inference rules with the stage of collecting input data in the subject area
Kopyrin A.S., Makarova I.L. —
Algorithm for preprocessing and unification of time series based on machine learning for data structuring
// Software systems and computational methods. – 2020. – ¹ 3.
– P. 40 - 50.
DOI: 10.7256/2454-0714.2020.3.33958
URL: https://en.e-notabene.ru/itmag/article_33958.html
Read the article
Abstract: The subject of the research is the process of collecting and preliminary preparation of data from heterogeneous sources. Economic information is heterogeneous and semi-structured or unstructured in nature. Due to the heterogeneity of the primary documents, as well as the human factor, the initial statistical data may contain a large amount of noise, as well as records, the automatic processing of which may be very difficult. This makes preprocessing dynamic input data an important precondition for discovering meaningful patterns and domain knowledge, and making the research topic relevant.Data preprocessing is a series of unique tasks that have led to the emergence of various algorithms and heuristic methods for solving preprocessing tasks such as merge and cleanup, identification of variablesIn this work, a preprocessing algorithm is formulated that allows you to bring together into a single database and structure information on time series from different sources. The key modification of the preprocessing method proposed by the authors is the technology of automated data integration.The technology proposed by the authors involves the combined use of methods for constructing a fuzzy time series and machine lexical comparison on the thesaurus network, as well as the use of a universal database built using the MIVAR concept.The preprocessing algorithm forms a single data model with the ability to transform the periodicity and semantics of the data set and integrate data that can come from various sources into a single information bank.
Kopyrin A.S. —
Modeling and forecasting of the dynamics of “quality of life” of the population of Krasnodar Krai
// Theoretical and Applied Economics. – 2020. – ¹ 2.
– P. 105 - 116.
DOI: 10.25136/2409-8647.2020.2.32192
URL: https://en.e-notabene.ru/etc/article_32192.html
Read the article
Abstract: The goal of this research consists in the analysis of trends in a compositely structured index of the “quality of life”, and analysis of sensitivity of a complex indicator by separate factors and cross-section of population of Krasnodar Krai. Based on the acquired results, the author builds a mathematical economic model for the analysis and forecasting of changes in the quality of life of the population of Krasnodar Krai in the context of diverse clusters of municipal formations, as well as compares different functions of approximation. The subject of this research is the socioeconomic interaction within the regional system of Krasnodar Krai. The article carries out a retrospective analysis is conducted on the most important indexes characterizing the subject field, correlation-regression analysis of variables, and sensitivity analysis based on the coefficients of elasticity of private factors. The author’s main contribution into the research of this topic lies in building the regression models of the quality of life index of the population of Krasnodar Krai in the context of diverse social clusters, as wll as in determination of the degree of impact of separate social, demographic or economic factors upon complex indicator. Such models allow forecasting and carrying out experimental modeling in this area.
Ignatenko A.M., Makarova I.L., Kopyrin A.S. —
Methods for preparing data for the analysis of poorly structured time series
// Software systems and computational methods. – 2019. – ¹ 4.
– P. 87 - 94.
DOI: 10.7256/2454-0714.2019.4.31797
URL: https://en.e-notabene.ru/itmag/article_31797.html
Read the article
Abstract: The aim of the study is to prepare for the analysis of poorly structured source data, their analysis, the study of the influence of data "pollution" on the results of regression analysis. The task of structuring data, preparing them for a qualitative analysis is a unique task for each specific set of source data and cannot be solved using a general algorithm, it will always have its own characteristics. The problems that may cause difficulties when working (analysis, processing, search) with poorly structured data are considered. Examples of poorly structured data and structured data that are used in the preparation of data for analysis are given. These algorithms for preparing weakly structured data for analysis are considered and described. The cleaning and analysis procedures on the data set were carried out. Four regression models were constructed and compared. As a result, the following conclusions were formulated: Exclusion from the analysis of various kinds of suspicious observations can drastically reduce the size of the population and lead to an unreasonable decrease in variation. At the same time, such an approach would be completely unacceptable if, as a result, important objects of observation are excluded from the analysis and the integrity of the population is violated. The quality of the constructed model may deteriorate in the presence of abnormal values, but may also improve due to them.