Automating cluster creation and management for Apache Spark in Openstack cloud
https://doi.org/10.15514/ISPRAS-2014-26(4)-3
Abstract
Keywords
About the Authors
O. BorisenkoRussian Federation
D. Turdakov
Russian Federation
S. Kuznetsov
Russian Federation
References
1. Apache Hadoop project web page — http://hadoop.apache.org/
2. Cloudera CDH Apache Hadoop project web page — http://www.cloudera.com/content/cloudera/en/products-and-services/cdh.html
3. Infinispan project web page — http://infinispan.org/
4. Basho Riak project web page — http://basho.com/riak/
5. Apache Spark project web page — http://spark.apache.org/
6. M. Chowdhury, M. Zaharia, I. Stoica. Performance and Scalability of Broadcast in Spark. 2010.
7. Gu, Lei, and Huan Li. Memory or Time: Performance Evaluation for Iterative Operation on Hadoop and Spark. High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on. IEEE, 2013.
8. VMWare Serengeti project web page — http://www.vmware.com/hadoop/serengeti
9. Cloudera Manager project web page — http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise/cloudera-manager.html
10. Openstack Sahara project web page, roadmap — https://wiki.openstack.org/wiki/Sahara/Roadmap
11. Foley, Matt. High Availability HDFS. 28th IEEE Conference on Massive Data Storage, MSST. Vol. 12. 2012.
12. Hunt, Patrick, et al. ZooKeeper: Wait-free Coordination for Internet-scale Systems. USENIX Annual Technical Conference. Vol. 8. 2010.
13. Massie, Matthew, B. Chun, and D. Culler. The ganglia distributed monitoring system: design, implementation, and experience. Parallel Computing 30.7 (2004): 817-840.
14. Amazon Elastic Compute Cloud (EC2) service webpage — http://aws.amazon.com/ec2/
15. Creeger, Mache. Cloud Computing: An Overview. ACM Queue 7.5 2009.
16. Openstack Heat project web page — https://wiki.openstack.org/wiki/Heat
17. Yokoyama, Shigetoshi, and Nobukazu Yoshioka. Cluster as a Service for self-deployable cloud applications. Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on. IEEE, 2012.
18. Chef project web page — http://www.getchef.com/
19. Salt project web page — http://www.saltstack.com/
20. Ansible project web page — http://www.ansible.com/home
21. In print. K. Chikhradze, А. Korshunov, N. Buzun, N. Kuzyurin. Ispol'zovanie modeli sotsial'noj seti s soobshhestvami pol'zovatelej dlya raspredelyonnoj generatsii sluchajnykh sotsial'nykh grafov [On a model of social network with user communities for distributed generation of random social graphs]. 10-ya Mezhdunarodnaya konferentsiya «Intellektualizatsiya obrabotki informatsii» [10th International conference “Intelligent Information Processing”] 2014.
Review
For citations:
Borisenko O., Turdakov D., Kuznetsov S. Automating cluster creation and management for Apache Spark in Openstack cloud. Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS). 2014;26(4):33-44. (In Russ.) https://doi.org/10.15514/ISPRAS-2014-26(4)-3