I am getting an error i check each line and it seems its form the last line which is
this is the error message:
# of failed Map Tasks exceeded allowed limit. FailedCount: 1. LastFailedTask: task_201310012238_0035_m_000000
this is the log :
Failed to read data from “hdfs://sandbox:8020/user/hue/Batting.csv”
2013-10-02 00:52:54,768 [main] ERROR org.apache.pig.tools.grunt.Grunt –
ERROR 1066: Unable to open iterator for alias join_data
I have ran the code several times and it gives same problem.
deleted the batting.csv file then uploaded it again. no help
What should I do next?
This is the code i am plugging in :
batting = LOAD ‘Batting.csv’ using PigStorage(‘,’);
runs = FOREACH batting GENERATE $0 as playerID, $1 as year, $8 as runs;
grp_data = GROUP runs BY (year);
max_runs = FOREACH grp_data Generate group as grp,MAX(runs.runs) as max_runs;
join_max_run = JOIN max_runs by ($0, max_runs), runs by (year,runs);
join_data = FOREACH join_max_run GENERATE $0 as year, $2 as playerID, $1 as runs;
when i run syntax check …
i get this message :
2013-10-02 01:04:46,222 [main] WARN org.apache.pig.PigServer – Encountered Warning IMPLICIT_CAST_TO_DOUBLE 1 time(s).
2013-10-02 01:04:46,266 [main] WARN org.apache.pig.PigServer – Encountered Warning IMPLICIT_CAST_TO_DOUBLE 1 time(s).
2013-10-02 01:04:46,282 [main] WARN org.apache.pig.tools.grunt.GruntParser – ‘dump’ statement is ignored while processing ‘explain -script’ or ‘-check’
script.pig syntax OK