Research on the effectiveness of data compression methods in relational and NoSQL DBMS
https://doi.org/10.21822/2073-6185-2024-51-1-87-94
Abstract
Objective. The research presented in the paper is aimed at studying the effectiveness of modern DBMSs.
Method. Along with traditional relational solutions, NoSQL-based DBMSs are becoming increasingly popular. This is primarily due to the ability of such systems to store and process huge volumes of data. At the same time, to work effectively with this data, it is necessary to ensure its compression. Compression allows you to reduce the amount of stored data and provide quick access to it.
Result. A comparative analysis of data compression in the MySQL and OpenSe DBMS was performed.
Conclusion. Data compression in OpenSearch is more efficient than in MySQL when using standard algorithms. OpenSearch is an excellent choice if you
need to conserve storage resources while making optimal use of the processor, otherwise both systems will do the job well.
Keywords
About the Authors
V. A. EgunovRussian Federation
Vitaly A. Egunov, Cand. Sci. (Eng.), Assoc. Prof., Computers and Systems Department
28 Lenin Ave., Volgograd 400005, Russia
V. S. Surin
Russian Federation
Valeriy S. Surin, Master Student, Computers and Systems Department
28 Lenin Ave., Volgograd 400005, Russia
P. S. Stupnitskiy
Russian Federation
Pavel S. Stupnitskiy, Master Student, Computers and Systems Department
28 Lenin Ave., Volgograd 400005, Russia
R. D. Akhmetova
Russian Federation
Rufina D. Akhmetova, Master Student, Computers and Systems Department
28 Lenin Ave., Volgograd 400005, Russia
References
1. Low, T.M., Igual, F.D., Smith, T.M., Quintana-Orti, E.S.: Analytical modeling is enough for highperformance BLIS. ACM Trans. Math. Softw. 2016; 43:1–18. https://doi.org/10.1145/2925987
2. Egunov, V.A. Povyshenie effektivnosti vektorizacii vychislenij [Improving the efficiency of vectorization of calculations]/ V.A. Egunov, A.G. Kravec. Matematicheskie metody v tekhnologiyah i tekhnike [Mathematical methods in technology and engineering]. 2023;3:65-68. - DOI: 10.52348/2712-8873_MMTT_2023_3_65. (In Russ)
3. Egunov, V.A. Metod uluchsheniya strategii keshirovaniya dlya vychislitel'nyh sistem s obshchej pamyat'yu [A method for improving the caching strategy for computing systems with shared memory] / V.A. Egunov, A.G. Kravec. Programmnaya inzheneriya [Software Engineering]. 2023;14(7):329-338. - DOI: 10.17587/prin.14.329-338. (In Russ)
4. Bychkov I. V. Podderzhka vychislenij v raspredelennyh sredah na osnove nepreryvnoj integracii [Support for computing in distributed environments based on continuous integration] / I. V. Bychkov, S. A. Gorskij, A. G. Feoktistov, R. O. Kostromin // Informacionnye tekhnologii [Information technology]. 2021; 27(12): 619-625. – DOI 10.17587/it.27.619-625. – EDN QSGWAI. (In Russ)
5. Kratkij obzor dvizhkov tablic MySQL [A brief overview of MySQL table engines] // habr.com – Rezhim dostupa [Access mode]: https://habr.com/ru/articles/64851/ (data obrashcheniya [date of application] 20.09.20273) .(In Russ)
6. Zlib 1.3 Manual // zlib.net – Rezhim dostupa [Access mode]: https://zlib.net/manual.html (data obrashcheniya [date of application] 20.09.2023)
7. Koranne, S. (2011). Compression Engines. In: Handbook of Open Source Tools. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-7719-9_8
8. CHernyak L., Bol'shoe hranilishche dlya bol'shih dannyh [Large storage for big data], L. CHernyak // Otkrytye sistemy. SUBD. [Open systems. DBMS.] 2010;5: 60. (In Russ)
9. Novyj algoritm szhatiya LZ4 ot TOAST v PostgreSQL 14. Naskol'ko bystrym on mozhet byt'? [TOAST's new LZ4 compression algorithm in PostgreSQL 14. How fast can it be?] // shunlongwei.com – Rezhim dostupa [Access mode] : https://www.shunlongwei.com/ru/toasts-new-compression-algorithm-lz4-in-postgresql-14-how-fast-can-it-be/ (data obrashcheniya [date of application] 20.09.2023) (In Russ)
10. Save space and money with improved storage efficiency in Elasticsearch 7.10 // www.elastic.co – Rezhim dostupa [Access mode]: https://www.elastic.co/blog/save-space-and-money-with-improved-storage-efficiencyin-elasticsearch-7-10 (data obrashcheniya [date of application] 20.09.2023)
11. Algoritm Deflate na primere formata PNG [Deflate algorithm on the example of PNG format] // habr.com – Rezhim dostupa [Access mode]: https://habr.com/ru/articles/274825/ (data obrashcheniya [date of application] 20.09.2023) (In Russ)
12. Yan, H., Lu, H., Gao, Q. (2012). A BP-LZ77 Compression Algorithm Based on BP Network. In: Jin, D., Lin, S. (eds) Advances in Electronic Engineering, Communication and Management Vol.2. Lecture Notes in Electrical Engineering, vol 140. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27296-7_34
13. LZO use and presentation//Programmer AI – Rezhim dostupa [Access mode]: https://programmerall.com/article/99471030002/ (data obrashcheniya [date of application] 20.09.2023)
14. PKZIP // PKWARE – Rezhim dostupa [Access mode]: https://www.pkware.com/products/pkzip/ (data obrashcheniya [date of application] 20.09.2023)
Review
For citations:
Egunov V.A., Surin V.S., Stupnitskiy P.S., Akhmetova R.D. Research on the effectiveness of data compression methods in relational and NoSQL DBMS. Herald of Dagestan State Technical University. Technical Sciences. 2024;51(1):87-94. (In Russ.) https://doi.org/10.21822/2073-6185-2024-51-1-87-94