[Apache Oozie] Quick Setup

Apache Oozie is a workflow scheduling system used to manage Hadoop jobs. In this post we shall look at a quick setup.

Pre-Conditions

JDK 1.7+

Building Oozie

Download Oozie source distribution.
Extract the distribution.

There are different options to customize based on Hadoop versions being used, but we would build a simple uber build

Execute the following command where the distribution was extracted

bash-3.2$ bin/mkdistro.sh -DskipTests -Puber

Once the build is done, the Oozie distribution can be found at /distro/tager folder, something like

bash-3.2$ cd distro/target/
bash-3.2$ ls
antrun				maven-shared-archive-resources	test-classes
archive-tmp			oozie-4.2.0-distro		tomcat
classes				oozie-4.2.0-distro.tar.gz
maven-archiver			oozie-distro-4.2.0.jar

Once done, copy the oozie-4.2.0-distro.tar.gz to a desired location and extract is. This becomes our Oozie installation home.

Setting up extjs

Download extjs2.2 to enable web console. You can find the same at following links

http://dev.sencha.com/deploy/ext-2.2.zip
http://archive.cloudera.com/gplextras/misc/ext-2.2.zip

Create a directory libext in Oozie installation home and copy extjs2.2.zip in the folder

Starting Oozie

From the Oozie installation home, execute the following

bash-3.2$ bin/oozied.sh start

This shall start the Oozie server in an embedded Tomcat. You can access Oozie console at

http://localhost:11000/oozie/

You should see the following screen.

Oozie Web Console

Oozie Web Console

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.