Volume 14 (2023) . Issue 4 (59) . Paper No. 2 (428)

Artificial intelligence and machine learning

Research Article

Method for classifying aspects of argumentation in Russian-language texts

Irina Nikolaevna Fishcheva1, Tatiana Anatolevna Peskisheva2, Valeriya Sergeevna Goloviznina3, Evgeny Vyacheslavovich Kotelnikov4Correspondent author

Vyatka State University, Kirov, Russia
4 Evgeny Vyacheslavovich Kotelnikov — Correspondent author kotelnikov.ev@gmail.com

Abstract. Argumentation mining in texts has attracted the attention of researchers in recent years due to a wide range of applications, in particular, in the analysis of scientific and legal texts, news articles, political debates, student essays and social media. Recently, a new task has been set in this area— aspect-based argumentation mining, where an aspect is defined as a property of the object, regarding which the argument is being built. Accounting for the aspects allows, on the one hand, to clarify the direction of the argumentation and understanding of the argument structure; on the other hand, it can be used to generate high-quality and aspect-specific arguments.

The article proposes a method for classifying aspects of argumentation in texts in Russian. On its basis we train and study the models for classifying aspects of argumentation using machine learning and neural networks. For the first time, a Russian-language text corpus was formed, including 1,426 sentences and marked by 16 aspects of argumentation, a neural network language model ArgBERT for classifying arguments was built, and Random Forest models were trained to classify aspects of argumentation. The classification performance obtained on the basis of Random Forest models is 0.6373 by F1-score. The developed models demonstrate the best performance for the aspects “Safety”, “Impact on health”, “Influence on the psyche”, “Attitude of the authorities” and “Standard of living” (F1‑score is higher than 0.75). (In Russian).

Keywords: argumentation mining, text corpora, neural network language models, machine learning, Random Forest, aspects of argumentation

MSC-20202020 Mathematics Subject Classification 68T07; 68T50MSC-2020 68-XX: Computer science
MSC-2020 68Txx: Artificial intelligence
MSC-2020 68T07: Artificial neural networks and deep learning

Acknowledgments: The study was supported by Russian Science Foundation grant No. 22-21-00885

For citation: Irina N. Fishcheva, Tatiana A. Peskisheva, Valeriya S. Goloviznina, Evgeny V. Kotelnikov. Method for classifying aspects of argumentation in Russian-language texts. Program Systems: Theory and Applications, 2023, 14:4, pp. 25–45. (In Russ.). https://psta.psiras.ru/2023/4_25-45.

Full text of article (PDF): https://psta.psiras.ru/read/psta2023_4_25-45.pdf.

The article was submitted 01.07.2023; approved after reviewing 19.07.2023; accepted for publication 20.07.2023; published online 19.10.2023.

© Fishcheva I. N., Peskisheva T. A., Goloviznina V. S., Kotelnikov E. V.
2023
Editorial address: Ailamazyan Program Systems Institute of the Russian Academy of Sciences, Peter the First Street 4«a», Veskovo village, Pereslavl area, Yaroslavl region, 152021 Russia; Phone: +7(4852) 695-228; E-mail: ; Website:  http://psta.psiras.ru
© Ailamazyan Program System Institute of Russian Academy of Science (site design) 2010–2024 The text of CC-BY-4.0 license