Apache Hadoop on MTA Cloud

By using Apache Hadoop cluster, we are able to process huge amount of data, we can run typical Big Data applications using MapReduce framework. The tutorial, which is available on the MTA Cloud's official website (https://cloud.mta.hu/apache-hadoop-klaszter-kiepitese), sets up a complete Apache Hadoop infrastructure with the help of Occopus orchestration tool. The built-in Apache Hadoop architecture will be established using Occopus tool, so we need to install Occopus first. Descriptors for installing the Hadoop cluster have been created for users and published for them. After downloading and personalizing descriptors, with just two commands, MTA Cloud users will be able to build a scalable Apache Hadoop infrastructure on MTA Cloud.

Publications:

Lovas R, Nagy E, Kovacs J: Cloud agnostic orchestration for big data research platforms, CIVIL-COMP PROCEEDINGS 111: p. III/15. 16 p. (2017), The Fifth International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering (ISBN 978-1-905088-66-9)
Kiadó: http://www.ctresources.info/ccp/paper.html?id=9237
Eprint: http://eprints.sztaki.hu/9246/
Nagy E, Kovács J, Lovas R: Automated and Portable Hadoop Cluster Orchestration on Clouds with Occopus for Big Data Applications, In: Bubak M, Turala M, Wiatr K (szerk.)
Proceedings of Cracow Grid Workshop'16, CGW 2016. 92 p. Academic Computer Centre CYFRONETAGH, 2016. pp. 47-48.(ISBN:978-83-61433-20-0)
Eprint: http://eprints.sztaki.hu/9030/

Department

Laboratory of Parallel and Distributed Systems

http://www.sztaki.hu/en/science/departments/lpds

Lágymányosi u. 11, Budapest XI, Hungary

Room number

L 511

+36 1 279 6064

kacsuk.peter@sztaki.hu