Tag Archive: coordinator

oozie workflow

How to schedule Cloudera Impala data pipelines in Apache Oozie?

Oozie is a software built on Hadoop with which we are able to create workflows and schedule them. We can build data pipelines, the components of the pipelines can be Java code, Sqoop, Pig, Hive or Shell script and so on. Inside the workflow jobs can be defined to run either in parallel or in sequence. There is a graphical interface made for Oozie inside HUE. Here we can conveniently define our jobs, manage and monitor them. Components…
Read more