Zenkov A.V., Zenkov M.A., Zenkov N.A. —
Pelevin vs Sorokin: an Attempt of Stylometric Comparison
// Philology: scientific researches. – 2024. – ¹ 7.
– P. 130 - 141.
DOI: 10.7256/2454-0749.2024.7.71169
URL: https://en.e-notabene.ru/fmag/article_71169.html
Read the article
Abstract: Our study is related to quantitative linguistics and focuses on the application of a new method for analyzing the author's style in literary texts. The method uses computer analysis of numerical data found in texts, including both cardinal and ordinal numerals, expressed both in numbers and verbally. Author used the program which automatically removed phraseological units and fixed combinations accidentally containing numerals. Before analysis, the text must be manually cleaned of numbers that do not contribute to the author's artistic vision, such as page numbers or chapter numbers. The analysis revealed that the use of numerals by an author in his/her texts is unique and individual, forming a characteristic feature that distinguishes texts by different authors. For the first time, a formal quantitative stylometric analysis is performed of the literary works by Victor Pelevin and Vladimir Sorokin – authors whose literary styles share many similarities when viewed through the lens of a traditional descriptive philological approach. To validate this methodology, we have also included the texts of four "impostor" authors in our analysis. It has been found that Pelevin's and Sorokin's texts differ significantly in their use of numerals. The data on occurrences of numerals in the texts were subjected to hierarchical clustering, which accurately divided the texts into groups based on their authorship. Since the clusterization results can be influenced by the choice of both metrics and clustering method, we tried various reasonable combinations of them to ensure the reliability of our results. Each time, the dendrogram would change only slightly. Thus, the clustering outcomes were found to be reliable. The proposed new method of quantitative linguistics, which is based on the analysis of numerals in literary texts, has the potential to successfully solve the stylometric problems, particularly related to the attribution of texts.
Zenkov A.V. —
Under a False Flag: Literary Hoaxes and the Use of Numerals
// Litera. – 2023. – ¹ 10.
– P. 86 - 109.
DOI: 10.25136/2409-8698.2023.10.68743
URL: https://en.e-notabene.ru/fil/article_68743.html
Read the article
Abstract: The present study pertains to stylometry. There are cases when a writer who has achieved fame, for various reasons, begins to create under a different name, tries to write in a different manner and, at times, again succeeds in a new incarnation. Whether the author is able to significantly change the literary style inherent in him or it is impossible to escape from himself – our work is devoted to the study of this issue. The study is based on the analysis of what numerals are present in the texts of an author. It has been shown by several examples from English-, French- and Russian-language literature, that the use of numerals is an author's feature that manifests itself in all or most of the sufficiently long texts of a given author. We apply our approach to the works of Romain Gary, Boris Akunin (Grigori Chkhartishvili) and some other authors of interest for stylometry. The conclusions are drawn on the basis of hierarchical cluster analysis and supported by the Pearson's chi-squared test.