Preview

Herald of Dagestan State Technical University. Technical Sciences

Advanced search

APPLICATION OF DATA ANALYSIS METHODS FOR AUTOMATION OF ONTOLOGY FORMATION

https://doi.org/10.21822/2073-6185-2018-45-1-172-180

Abstract

Objectives. The aim of this work is to develop methods for automated text analysis and the retrieval of relevant data from full-text documents, as well as applying semantic text analysis methods for using linguistic ontologies as formalised models of subject area representation. Another aim is the use of electronic encyclopedias, primarily Wikipedia, as the basis for constructing the linguistic ontologies in order to derive maximum semantic information about their concepts, vocabulary expressions, interrelations and hierarchy.

Methods.The search for solutions based on system analysis methods is based on the emergence of new technologies that for solving both the text itself and the object of research that is to be solved as a result of such processing. When creating contemporary artificial intelligence systems or their components, developers and researchers often face the need to formalise a certain subject area in order to automate the processing of phrases, word collocations and sentences entering the system in natural language form. Currently, the most popular approach to the formal description of a subject area is to construct an ontology.

Results. Established approaches to the retrieval of information are described along with the architecture of the automated system and the results of their application.

Conclusion. Semantic data analysis methods are applied with linguistic ontologies used as the formalised models of subject area representation. Approaches to retrieving information from Wikipedia are described along with the architecture of the automated system and results of its application.

About the Author

O. N. Yurkova
Bryansk State Engineering and Technology University
Russian Federation

Olga N. Yurkova – Cand. Sci.(Economics), Assoc. Prof., Department of Information Technologies.

3 Stanke Dimitrova Ave., Bryansk  241037



References

1. Averchenkov V.I., Roshchin C.M. Monitoring i sistemnyi analiz informatsii v seti Internet. Bryansk: BGTU; 2006. 160 s. [Averchenkov V.I., Roshchin C.M. Monitoring and system analysis of information on the Internet. Bryansk: BGTU; 2006. 160 p. (In Russ.)]

2. Barsegyan A.A., Kupriyanov M.S., Stepanenko V.V., Kholod I.I. Tekhnologii analiza dannykh. Data Mining, Visual Mining, Text Mining, OLAP. SPb.: BKhV-Peterburg; 2007. 384 s. [Barsegyan A.A., Kupriyanov M.S., Stepanenko V.V., Kholod I.I. Technologies of data analysis. Data Mining, Visual Mining, Text Mining, OLAP. SPb.: BKhV-Peterburg; 2007. 384 p. (In Russ.)]

3. Gavrilova T.A. Ontologicheskii podkhod k upravleniyu znaniyami pri razrabotke korporativnykh informatsionnykh sistem. Novosti iskusstvennogo intellekta. 2003;2:24-30. [Gavrilova T.A. Ontological approach to knowledge management in the development of corporate information systems. Novosti iskusstvennogo intellekta. 2003;2:24-30. (In Russ.)]

4. Kopeliovich D.I., Yurkova O.N. Printsipy postroeniya avtomatizirovannykh sistem monitoringa sotsial'noekonomicheskikh ob"ektov. Vestnik Astrakhanskogo gosudarstvennogo tekhnicheskogo universiteta. Seriya: Upravlenie, vychislitel'naya tekhnika i informatika. 2015;1:98-104. [Kopeliovich D.I., Yurkova O.N. Principles of constructing automated monitoring systems for socio-economic objects. Bulletin of the Astrakhan State Technical University. Series: Management, Computer Engineering, Computer Science. 2015;1:98104. (In Russ.)]

5. Kopeliovich D.I., Ryzhenkov D.A. Funktsional'noe modelirovanie protsessa monitoringa dannykh. Monitoring. Nauka i tekhnologii. 2016;1:49-53. [Kopeliovich D.I., Ryzhenkov D.A. Functional modeling of the data monitoring process.. Monitoring. Nauka i tekhnologii. 2016;1:49-53. (In Russ.)]

6. Kravtsov D.V., Korostelev D.A., Yurkova O.N. Avtomatizirovannaya sistema dlya postroeniya ontologii predmetnykh oblastei. Monitoring. Nauka i Tekhnologii, 2017;1(30):46-50. [Kravtsov D.V., Korostelev D.A., Yurkova O.N. Automated system for constructing ontologies of subject areas. Monitoring. Nauka i Tekhnologii, 2017;1(30):46-50. (In Russ.)]

7. Naikhanova L.V. Osnovnye aspekty postroeniya ontologii verkhnego urovnya i predmetnoi oblasti. Sbornik nauchnykh statei ―Internet-portaly: soderzhanie i tekhnologii‖. FGU GNII ITT ―Informatika‖. M.: Prosveshchenie. 2005;3:452-479. [Naikhanova L.V. The main aspects of constructing ontologies of the top level and subject domain. Collection of scientific articles ―Internet portals: content and technology‖. FGU GNII ITT ―Informatika‖. M.: Prosveshchenie. 2005;3:452-479. (In Russ.)]

8. Lapshin V.A. Ontologii v komp'yuternykh sistemakh. M.: Nauchnyi mir; 2010. 222 s. [Lapshin V.A. Ontologies in computer systems. M.: Nauchnyi mir; 2010. 222 p. (In Russ.)]

9. Teslinova E.A. Razrabotka ontologii sistemy upravleniya znaniyami organizatsii s ispol'zovaniem metodologii kontseptual'nogo proektirovaniya. Uspekhi sovremennogo estestvoznaniya. 2006;9:96-98. [Teslinova E.A. Development of the ontology of the organisation's knowledge management system using the conceptual design methodology. Advances in current natural sciences. 2006;9:96-98. (In Russ.)]

10. Evgenev G.B. Intellektual'nye sistemy proektirovaniya. M.: Izd-vo MGTU im. N.E. Baumana; 2009. 334 s. [Evgenev G.B. Intelligent design systems. M.: Izd-vo MGTU im. N.E. Baumana; 2009. 334 p. (In Russ.)]

11. Gruber T. A translation approach to portable ontologies. Knowledge Acquisition. 1993;5:199-220.

12. Studer R., Benjamins R., Fensel D. Knowledge Engineering: Principles and methods. Data and knowledge engineering. 1998;25:161-197.

13. Yildiz V., Miksch S. Ontology-Driven Information Systems: Challenges and Requirements. International Conference on Semantic Web and Digital Libraries. Indian Statistical Institute Platinum Jubilee Conference Series. 2007. P. 35-44.

14. Miroshnikov V.V., Bulatitskii D.I. Ontologicheskaya model' sistemy upravleniya znaniyami v oblasti kachestva. Vestnik BGTU. 2009;4:100-106. [Miroshnikov V.V., Bulatitskii D.I. Ontological model of the knowledge management system in the field of quality. Vestnik BGTU. 2009;4:100-106. (In Russ.)]

15. Averchenkov V.I., Shkaberin V.A. Formalizatsiya opisaniya predmetnoi oblasti ―Obespechenie tekhnologichnosti konstruktsii izdelii v integrirovannykh SAPR‖ na osnove ontologii. Spravochnik. Inzhenernyi zhurnal. 2009;10:32-38. [Averchenkov V.I., Shkaberin V.A. Formalisation of the description of the ―Ensuring the manufacturability of product designs in integrated CAD systems‖ subject area on the basis of ontology. Handbook. Engineering Journal. 2009;10:32-38. (In Russ.)]

16. Smirnov S.V. Ontologicheskii analiz predmetnykh oblastei modelirovaniya. Izvestiya Samarskogo nauchnogo tsentra RAN. 2001;3(1):62-70. [Smirnov S.V. Ontological analysis of modeling subject areas. Izvestia of Samara Scientific Center of the Russian Academy of Sciences. 2001;3(1):62-70. (In Russ.)]

17. Antonov I.V., Voronov M.V. Formirovanie ontologicheskikh modelei predmetnoi oblasti dlya elektronnykh obuchayushchikh sistem. Informatsionnye tekhnologii v obespechenii novogo kachestva vysshego obrazovaniya. Sbornik nauchnykh statei. Kn. 2. M.: Issledovatel'skii tsentr problem kachestva podgotovki spetsialistov. 2010. S. 48–55. [Antonov I.V., Voronov M.V. Formation of ontological models of the subject area for electronic learning systems. Information technology in ensuring a new quality of higher education. Collection of scientific articles. Book 2. M.: Issledovatel'skii tsentr problem kachestva podgotovki spetsialistov. 2010. S. 48–55. (In Russ.)]

18. Buitelaar P., Cimiano P., Magnini B. Ontology Learning from Text: Methods. Evaluation and applications. IOS Press. 2005.

19. Korshunov A.A., Turdakov D., Jeong J., Lee M., Moon C. Category-Driven Approach to Deriving Domain Specific Subset of Wikipedia. SYRCoDIS. 2011. P.43-53.

20. Varlamov M.I., Korshunov A.V. Raschet semanticheskoi blizosti kontseptov na osnove kratchaishikh putei v grafe ssylok Vikipedii. Mashinnoe obuchenie i analiz dannykh. 2014;1(8):1107–1125. [Varlamov M.I., Korshunov A.V. Calculation of semantic proximity of concepts based on shortest paths in the Wikipedia link graph. Machine Learning and Data Analysis. 2014;1(8):1107–1125. (In Russ.)]


Review

For citations:


Yurkova O.N. APPLICATION OF DATA ANALYSIS METHODS FOR AUTOMATION OF ONTOLOGY FORMATION. Herald of Dagestan State Technical University. Technical Sciences. 2018;45(1):172-180. (In Russ.) https://doi.org/10.21822/2073-6185-2018-45-1-172-180

Views: 798


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2073-6185 (Print)
ISSN 2542-095X (Online)