THE ALGORITHMS FOR SOFTWARE SYSTEM OF SCIENTIFIC PUBLICATIONS ANALYSIS

  • Yevgen Berlizov Odessa National Polytechnic University
  • Viktor Hapiak Odessa National Polytechnic University
  • Vira Liubchenko Odessa National Polytechnic University

Abstract

The purpose of the work is to define algorithms for the software system of scientific publications
analysis, designed to identify research areas and groups of researchers with similar interests within the same university
or faculty.
There are many algorithms for solving information extracting problems, but they have some disadvantages
regarding the solved problem. Therefore, we developed a proprietary algorithm that consists of four steps: lexical
analysis, terminals normalization, entities combining and filtering.
The results of information extracting are used to solve identification problems of authors groups and keywords
groups considered as a clustering problem. The analyzed data are presented in the form of graphs of two types: a
weighted graph of authors’ interactions and semantic graph of papers. This allows using for the analysis the clustering
algorithms based on graph theory and algorithm of stochastic analysis MCL. An analysis of a test articles sample
showed that clustering algorithms based on graph theory and algorithm of MCL identified the same clusters, but the
algorithm that based on minimum spanning tree was better regarding computational complexity.

Author Biographies

Yevgen Berlizov, Odessa National Polytechnic University

Student of
Odessa National
Polytechnic University

Viktor Hapiak, Odessa National Polytechnic University

Student of Odessa National
Polytechnic University

Vira Liubchenko, Odessa National Polytechnic University

D.Sc., Assoc. Prof.
Head of System Software
Department, Odessa
National Polytechnic
University

Published
2016-05-31
How to Cite
Berlizov, Y., Hapiak, V., & Liubchenko, V. (2016). THE ALGORITHMS FOR SOFTWARE SYSTEM OF SCIENTIFIC PUBLICATIONS ANALYSIS. Electrotechnic and Computer Systems, (23(99), 53-57. Retrieved from https://eltecs.op.edu.ua/index.php/journal/article/view/739
Section
Information systems and technologies