AnsweredAssumed Answered

MapReduce job (wordcount) failed to run in a newly installed cluster with v2.0

Question asked by linda on Sep 2, 2012
Latest reply on Sep 5, 2012 by linda
Hi MapR experts,

The sample wordcount job hung after triggered and never returned until killed from console. No clear error message standing out. More detail info in below.

 1. MapR version: 2.0
 2. Cluster: one node cluster in a vm image with CentOS OS
 3. Services: All services are running with no error before trigger the job. (i.e. jobtracker/tasktraker/nfs/etc)
 4. Job triggered as root:

     # hadoop jar /opt/mapr/hadoop/hadoop-0.20.2/hadoop-0.20.2-dev-examples.jar wordcount /data/in /data/out

 5. Data: one test file copied to /data/in (content is one liner word list)
 6. File: hadoop-mapr-tasktracker-n1.log reads:

        echo 10 > /proc/self/oom_score_adj;renice -n 10 -p $$ 1>/dev/null;
        2012-09-02 15:57:02,737 INFO org.apache.hadoop.mapred.TaskController: Writing commands to /tmp/mapr-hadoop/mapred/local/ttprivate/taskTracker/root/jobcache/job_201209021529_0003/attempt_201209021529_0003_m_000001_0/taskjvm.sh
        2012-09-02 15:57:03,747 INFO org.apache.hadoop.mapred.TaskTracker: Setting pid 26036 for jvm jvm_201209021529_0003_m_540175592
        2012-09-02 15:57:03,748 INFO org.apache.hadoop.mapred.TaskTracker: JVM with ID: jvm_201209021529_0003_m_540175592 given task: attempt_201209021529_0003_m_000001_0
        2012-09-02 15:57:04,140 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201209021529_0003_m_000001_0 0.0%
        2012-09-02 15:57:04,255 INFO org.apache.hadoop.mapred.TaskTracker: attempt_201209021529_0003_m_000001_0 0.0% cleanup
        2012-09-02 15:57:04,258 INFO org.apache.hadoop.mapred.TaskTracker: Task attempt_201209021529_0003_m_000001_0 is done.
        2012-09-02 15:57:04,258 INFO org.apache.hadoop.mapred.TaskTracker: reported output size for attempt_201209021529_0003_m_000001_0  was -1
        2012-09-02 15:57:04,258 INFO org.apache.hadoop.mapred.TaskTracker: addFreeSlot : current free slots : 1
        2012-09-02 15:57:04,499 INFO org.apache.hadoop.mapred.TaskTracker: Received 'KillJobAction' for job: job_201209021529_0003
        2012-09-02 15:57:04,500 INFO org.apache.hadoop.mapred.UserLogCleaner: Adding job_201209021529_0003 for user-log deletion with retainTimeStamp:1346713024499
        2012-09-02 15:57:04,573 INFO org.apache.hadoop.mapred.TaskTracker: Killing JVM jvm_201209021529_0003_m_540175592 since job job_201209021529_0003 is dead
        2012-09-02 15:57:04,691 WARN org.apache.hadoop.mapred.LinuxTaskController: Exit code from task is : 137
        2012-09-02 15:57:04,691 INFO org.apache.hadoop.mapred.JvmManager: JVM : jvm_201209021529_0003_m_540175592 exited with exit code 137. Number of tasks it ran: 1

Outcomes