Preview

Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS)

Advanced search

Distributed data storage systems: analysis, classification and choice

https://doi.org/10.15514/ISPRAS-2015-27(6)-15

Abstract

This paper proposes an analysis of various distributed data storage systems and possible solutions to basic problems of the subject area, in particular, the issue of system scaling, data consistency, availability and partition tolerance. Performed analysis has allowed to reveal consistent patterns with an attempt to classify systems based on various parameters, in particular, the presence or absence of specific functions and mechanisms. The choices of the distributed data storage systems are based on data analysis and classification.

About the Authors

Alexander Tormasov
Innopolis University
Russian Federation


Anatoly Lysov
Innopolis University
Russian Federation


Emil Mazur
Innopolis University
Russian Federation


References

1. IBM Platform Computing Edition, Software Defined Storage For Dummies,New Jersey, 2014, pp.4-5.

2. Software-Defined Storage (SDS) - perspektivy rosta [growth prospects] - ChannelForIT. Available at: http://channel4it.com/blogs/Programmno-opredelyaemye-hranilishcha-vsyo-bolee-vostrebovany-6787.html (accessed 23 July 2015). (In Russian)

3. List of file systems - Wikipedia, the free encyclopedia. Available at: https://en.wikipedia.org/wiki/List_of_file_systems (accessed 23 July 2015).

4. Google Trends. Available at: https://www.google.com/trends (accessed 23 July 2015).

5. M.Placek, R. Buyya, A taxonomy of distributed storage systems, p. 53.

6. Kak Facebook sekonomil 75% energii, kotoraya trebuetsya dlya khraneniya dannykh pol’zovatelei [Fasebook saved 75% of the energy required to store users' data] / King Servers company’s blog / Habrahabr. Available at: http://habrahabr.ru/company/kingservers/blog/257699/ (accessed 23 July 2015). (In Russian)

7. Data compression - Wikipedia, the free encyclopedia. Available at: https://en.wikipedia.org/wiki/Data_compression (accessed 23 July 2015).

8. Deduplikatsiya dannykh - podkhod NetApp [Deduplication - NetApp’s approach] / NetApp Company’s Blog / Habrahabr. Available at: http://habrahabr.ru/company/netapp/blog/110482/ (accessed 23 July 2015). (In Russian)

9. Vvedenie v deduplikatsiyu dannykh [Introducition to data duplication] / Veeam Software company’s blog. Available at: http://habrahabr.ru/company/veeam/blog/203614/ (accessed 23 July 2015). (In Russian)

10. Erasure code - Wikipedia, the free encyclopedia. Available at: https://en.wikipedia.org/wiki/Erasure_code (accessed 23 July 2015).

11. O chem stoit zadumat’sya, sokhranyaya svoi dannye v oblake. Chast’ 2 [What you should thinking about, when you saving data in cloud. Part 2] / Habrahabr. Available at: http://habrahabr.ru/post/141487/ (accessed 23 July 2015). (In Russian)

12. Vysokaya dostupnost’ web-saita: georeplikatsia failov saita s “lsyncd” [Web site's high availability: file geo-replication of site with “lsyncd”] / Habrahabr. Available at: http://habrahabr.ru/company/infobox/blog/252751/ (accessed 23 July 2015). (In Russian)

13. Object storage - Wikipedia, the free encyclopedia. Available at: https://en.wikipedia.org/wiki/Object_storage (accessed 23 July 2015).

14. Ob’’ektnaya sistema khraneniya dannykh - konkurent zheleznykh SKHD [Object system of data storage - the competitor of hardware SAN]. Available at: http://www.jetinfo.ru/stati/konkurenty-zheleznykh-skhd (accessed 23 July 2015). (In Russian)

15. Y. R'kaina, Reliable and persistent storage for CoDeS a distributed collaborative system // 2013, p. 40

16. An Overview of Tahoe-LAFS. Secure and fault tolerant distributed storage system. Available at: https://code.google.com/p/nilestore/wiki/TahoeLAFSBasics (accessed 23 July 2015).

17. CAP theorem - Wikipedia, the free encyclopedia. Available at: https://ru.wikipedia.org/wiki/Теорема_CAP (accessed 23 July 2015). (In Russian)

18. Soglassovannye v konechnom schete [Eventually Consistent]. Available at: http://habrahabr.ru/post/100891 (accessed 23 July 2015). (In Russian)

19. B. Calder, J. Wang, A. Ogus, N. Nilakantan, A. Skjolsvold, S. McKelvie, Y. Xu, S. Srivastav, J. Wu, H. Simitci, J. Haridas, C. Uddaraju, H. Khatri, A. Edwards, V. Bedekar, S. Mainali, R. Abbasi, A. Agarwal, M. Fahim ul Haq, M. Ikram ul Haq, D. Bhardwaj, S. Dayanand, A. Adusumilli, M. McNett, S. Sankaran, K. Manivannan, L. Rigas Windows Azure Storage: a highly available cloud storage service with strong consistency // SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles, 2011, 143-157.

20. M. Campagna, AWS Key Management Service Cryptographic Details, 2015, p. 28

21. Autonomic computing - Wikipedia, the free encyclopedia. Available at: https://en.wikipedia.org/wiki/Autonomic_computing (accessed 23 July 2015).

22. Self-management (computer science) - Wikipedia, the free encyclopedia. Available at: https://en.wikipedia.org/wiki/Self-management_(computer_science) (accessed 23 July 2015).

23. Google poteryal chast’ dannykh pol’zovatelei iz-za udara molnii - BBC Russkaya sluzhba [Google lost a part of users' data because of a lightning strike - BBC Russian Service]. Available at: http://www.bbc.com/russian/international/2015/08/150819_google_lightning_data (accessed 25 August 2015). (In Russian)


Review

For citations:


Tormasov A., Lysov A., Mazur E. Distributed data storage systems: analysis, classification and choice. Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS). 2015;27(6):225-252. (In Russ.) https://doi.org/10.15514/ISPRAS-2015-27(6)-15



Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2079-8156 (Print)
ISSN 2220-6426 (Online)