Savanna is the OpenStack data processing project. Elastic data processing (EDP) allows you to run MapReduce, Pig, and Hive jobs on top of Hadoop clusters provisioned by Savanna. We bring you a demo of Savanna V0.3 release, which will show you how to launch and execute a Pig job.
The demo first gives you an overview of objects in the Savanna dashboard, drilling into the detail of plugins and cluster provisioning templates. Before running jobs, you specify some data to run. In this case, github repositories of OpenStack projects are cloned, placed in an archive, and uploaded to a Swift container. Then, the demo shows you how to create a data source for the input, specify where to store the data, and create a job binary to execute EDP. Finally, you create the job and launch the job on the cluster.