Translate this page:
Please select your language to translate the article


You can just close the window to don't translate
Library
Your profile

Back to contents

Software systems and computational methods
Reference:

Naykhanov N.V., Dyshenov B.A. Computing semantic similarity of concepts using Wikipedia link

Abstract: The research question is the semantic relatedness of terms. The target of research is measure the semantic relatedness of terms. The authors consider such aspects as the rationale for the choice of the theme of background knowledge, the construction of a graph of links and measurement of relatedness between concepts. In earlier studies the authors of semantic proximity is calculated based on the statistical characteristics using different contextual analysis methods, such as latent semantic analysis. This work is the first experience with the reference methods for determining a semantic relatedness. Therefore, the focus placed on ease of calculation steps. Evaluation semantic similarity is based on the WLM method and proximity measure for separate types of references of M. I. Varlamov, A.V. Korshunov. In contrast to the well-known measures of semantic proximity, based on the use of Wikipedia proposed in the measure uses a simple links Wikipedia articles such as "See. Also" and "Links". This approach allows us to raise the performance of the algorithm and is designed for use in applications requiring high accuracy of the result is not, and better performance of the algorithm. These tasks include establishing a correspondence between the competencies and educational standard annotations disciplines of the curriculum or the task of analyzing the students' answers to the open questions in the form. The developed measure is cheap, reasonably accurate and accessible.


Keywords:

link, structure of article of Wikipedia, the database of Wikipedia, background knowledge, semantic similarity of concepts, concept, link graph, distance between concepts, count indexing, link-based Measure


This article can be downloaded freely in PDF format for reading. Download article


References
1. Witten I., Milne D. An effective, low-cost measure of semantic relatedness obtained from Wikipedia links // Proceeding of AAAI Workshop on Wikipedia and Artificial Intelligence: an Evolving Synergy, AAAI Press, Chicago, USA. 2008. R. 25-30
2. Turdakov D.Yu. Texterra: infrastruktura dlya analiza tekstov / D.Yu. Trudakov i dr. // Trudy Instituta sistemnogo programmirovaniya RAN. 2014. T. 26. Vyp. 1. S. 421-438.
3. Russkaya Vikipediya [Elektronnyy resurs]. – URL: https://ru.wikipedia.org/wiki/Russkaya_Vikipediya (data obrashcheniya: 20.06.2016).
4. Varlamov M.I. Raschet semanticheskoy blizosti kontseptov na osnove kratchayshikh putey v grafe ssylok Vikipedii [Elektronnyy resurs]: prezentatsiya / M.I. Varlamov, A.V. Korshunov // URL: www.machinelearning.ru/wiki/images/f/fd/Varlamov2014iip.pdf (data obrashcheniya: 20.06.2016).
5. Varlamov M.I. Raschet semanticheskoy blizosti kontseptov na osnove kratchayshikh putey v grafe ssylok Vikipedii / M.I. Varlamov, A.V. Korshunov // Mashinnoe obuchenie i analiz dannykh. 2014. T. 1. № 8. S. 1107-1125.
6. Angliyskaya Vikipediya [Elektronnyy resurs]. – URL: https://ru.wikipedia.org/wiki/Angliyskaya_Vikipediya (data obrashcheniya: 20.06.2016).
7. Anisimov A.V. Metod vychisleniya semanticheskoy blizosti-svyaznosti mezhdu slovami estestvennogo yazyka / A.V. Anisimov, A.A. Marchenko, V.K. Kisenko // Kibernetika i sistemnyy analiz. 2011. № 4. S.18-27.