Oozie workflow could not be created with graphical interface.

to create new topics or reply. | New User Registration

This topic contains 1 reply, has 2 voices, and was last updated by  Yi Zhang 1 year, 3 months ago.

  • Creator
  • #44126

    Shahzad Aslam

    I could not create a oozie workflow usnig graphical interface of oozie. i have only one action MapReduce here is the error i am getting

    Caused by: java.lang.ClassNotFoundException: Class MapClass not found

    PLease note that the jar i am using is running fine when i initiate map reduce from command line that means jar i have for wordcount is wokring fine

    it seems like map class is not found

    here are parameters i set

    mapred.job.queue.name default
    mapred.mapper.class MapClass
    mapred.reducer.class Reduce
    mapred.input.dir /tmp/test
    mapred.output.dir /tmp/res
    mapred.map.tasks 1
    mapred.mapoutput.key.class org.apache.hadoop.io.Text
    mapred.mapoutput.value.class org.apache.hadoop.io.IntWritable

    in the end of this post are the classes i have. both classes in default package in eclipse folder structure. now what is the setting for correct path to the mapper class and reducer class.

    public class MapClass extends MapReduceBase
    implements Mapper
    private Text word = new Text();
    private final static IntWritable one = new IntWritable( 1 );

    public void map( LongWritable key, // Offset into the file
    Text value,
    OutputCollector output,
    Reporter reporter) throws IOException
    // Get the value as a String
    String text = value.toString().toLowerCase();

    public class Reduce extends MapReduceBase
    implements Reducer
    public void reduce( Text key, Iterator values,
    OutputCollector output,
    Reporter reporter) throws IOException

Viewing 1 replies (of 1 total)

You must be to reply to this topic. | Create Account

  • Author
  • #44806

    Yi Zhang

    Hi Shahzad,

    Can you post the workflow.xml file? Wonder if you have put your jar files in locations recognizable by oozie.


Viewing 1 replies (of 1 total)
Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.