Severina E.M., Fyodorov N.A. —
The Chekhov Digital project: semantic markup of a parallel corpus of translations of Chekhov's fiction into German
// Philology: scientific researches. – 2024. – ¹ 4.
– P. 73 - 82.
DOI: 10.7256/2454-0749.2024.4.70560
URL: https://en.e-notabene.ru/fmag/article_70560.html
Read the article
Abstract: The article discusses the issues of developing the principles of a semantically marked parallel corpus of translations of Chekhov's fiction into German within the framework of the Chekhov Digital project, a digital academic publication of the writer's collected works in TEI (Text Encoding Initiative) format. The parallel corpus project is focused on creating a digital infrastructure for studying the writer's works, allowing researchers to analyze and compare original texts with their translations. Difficulties were identified related to the interpretation of significant elements of the writer's works, the specifics of their translation into German and the semantic markup of translations of fiction, for example, difficulties arose with defining the boundaries and relationships between the elements of semantic markup. Ways to overcome them are proposed, including the use of digital methods and natural language processing technologies. The project uses digital methods and technologies of natural language processing, the standard of digital publication Text Encoding Initiative (TEI). The text markup structure based on the TEI standard makes documents machine-readable, which allows to develop tools for complex semantic information retrieval. The inclusion in the Chekhov Digital project of parallel corpora of translations of A. P. Chekhov's works into different languages makes it possible to expand research tools in the field of translation studies, making it possible to compare texts of translations and originals, detect similarities and differences in vocabulary, grammar, style and cultural references, as well as automate routine research processes, which makes search and analysis much more effective information on large volumes of texts. The results of the project will contribute to the development of the digital humanitarian environment, contributing to the preservation and popularization of the literary heritage of A.P. Chekhov. The creation of a semantically marked parallel corpus of translations will be important for literary critics, linguists and translators, allowing them to study the specifics of translations of Chekhov's works and develop new forms of text analysis and interpretation. The experience gained during the project will be valuable for future research and practical applications, demonstrating the effectiveness of digital technologies in humanitarian research and education.
Severina E.M., Larionova M.C. —
The Chekhov Digital project: tasks and problems of implementing semantic markup of texts (on the example of A. P. Chekhov's story "The Death of an Official")
// Litera. – 2023. – ¹ 10.
– P. 211 - 222.
DOI: 10.25136/2409-8698.2023.10.68862
URL: https://en.e-notabene.ru/fil/article_68862.html
Read the article
Abstract: The article considers a model of preparation of machine-readable (semantic) markup of texts for the Chekhov Digital project on the example of philological interpretation of individual significant elements of A. P. Chekhov's story "Death of an Official" and presentation of this information explicitly based on the standards of digital publication Text Encoding Initiative (TEI/XML). Based on the work of literary researchers, significant entities have been identified for marking up the corpus of the writer's texts, but the question of their representation in the text remains quite complex. A philological examination of such aspects as "properties, states and events; character features" in an excerpt from the story of A.P. Chekhov was carried out from the point of view of the TEI markup capabilities for preserving philological knowledge in a machine-readable format. One of the objectives of the Chekhov Digital project is to go beyond a simple digitized text and provide useful digital tools for the researcher. The elements of machine-readable markup are presented, which make it possible to mark up significant entities in Chekhov's texts for organizing semantic search through the corpus of the writer's texts, the problems and research tasks arising in the process of implementing such interdisciplinary projects due to the need to combine the efforts of specialists from different fields of knowledge are considered. The project implements the principle of Open research data, the most important task of which is to create scientific communities around data. The work on the project led to the development of scientific cooperation between researchers of the Higher School of Economics, the UNC RAS and the SFU.
Severina E.M., Larionova M.C. —
New philological practices: semantic edition of A. P. Chekhov’s texts
// Philology: scientific researches. – 2020. – ¹ 10.
– P. 13 - 21.
DOI: 10.7256/2454-0749.2020.10.33970
URL: https://en.e-notabene.ru/fmag/article_33970.html
Read the article
Abstract: The authors present the concept of semantic digital publication of A. P. Chekhov's texts, giving special attention to the research project Chekhov Digital implemented in Southern Federal University and the Southern Scientific Center of the Russian Academy of Sciences. The goal of the project consists in creation of semantic marling of the compilation of writer’s works supported by the standards for digital publication Text Encoding Initiative (TEI) that allows presenting the text in format that can be processed by a computer, as well as the standards for electronic format of Russian-language literary texts displayed in the Semantic edition of the texts of L. N. Tolstoy. The crucial task of the project consists in provision of digital tools useful for the researchers, including convenient semantic search, statistical processing and visualization tools. The main source is the Academic Compilation of Complete Works and Letters of A. P. Chekhov in 30 volumes (1974-1983). Currently, a preliminary structure of markup is developed for some categories that exist in Chekhov's texts, notes, commentaries, and description of the volumes of Academic Compilation of Complete Works and Letters of A. P. Chekhov, which arranging semantic search of information, make editorial notes computer-readable, and more adjusted to studying not only the traditional approach, but also computer methods. It is underlined that text markup would expand due to inclusion of new categories; and the semantic edition itself would broaden its network due to inclusion of reference and editorial-peered texts that are beyond the scope of the Academic Compilation of Complete Works and Letters of A. P. Chekhov. Semantic edition Chekhov Digital gives grounds for a new type of commentary that incorporates meanings and facts, complicates and enriches the interpretation of text. Such edition is valuable for the researchers, pedagogues, students, and audience interested in the topic.
Severina E.M. —
Cultural concept in cognitive dimension
// Man and Culture. – 2020. – ¹ 6.
– P. 81 - 93.
DOI: 10.25136/2409-8744.2020.6.34161
URL: https://en.e-notabene.ru/ca/article_34161.html
Read the article
Abstract: This article reviews the methodological principles of studying cultural concepts in the context of cognitive approach, possibilities for conducting reconstruction of certain fragments of linguistic worldview based on the material of digital text corpora. Leaning on the cognitive approach towards concept as a unit of structured and unstructured knowledge that forms cognition of a separate individual and culture as a whole, results of conceptual research of the texts of philosophers who view culture as symbolic creativity of a person associated with freedom (concepts of I. Kant, E. Cassirer, N. A. Berdyaev), the authors conducted reconstruction of certain fragments of the linguistic worldview and ordinary consciousness, correlated with the concept of “culture” in digital text corpora in the Russian and Anglo-Saxon cultures. Examination of the contexts of usage of verbal representations of the concept of “culture” in the digital text corpora of Russian language and different varieties of English language demonstrates that the crucial ideological values of Anglo-Saxon linguistic worldview are the following representations: culture is of instrumental nature; civilization is considered as the path development of humanity; freedom is viewed as an intrinsic right to freedom that should be protected, i.e. initial and inherent to a human. In the Russian-language texts, culture implies the value-based attitude towards world, mostly associated with the national culture; civilization is viewed in the context of a value-based attitude towards world, but as the path of development of humanity as a whole; freedom has value-based individual, personalistic connotation, supposed to be full, absolute, which is often understood as the liberty of action and choice. It is underlines that utilization of corpus methods allows reconstructing the techniques of formation of worldview, choice of value priorities, mechanisms of perception of surrounding reality in a specific culture from contexts of practical usage of the verbal manifestations of cultural concepts.