AnsweredAssumed Answered

Submitted job intermittently freezing/failing

Question asked by bradford on Jul 3, 2012
Latest reply on Aug 10, 2012 by bradford
Greetings,

A submitted job occasionally fails to launch on jobtracker with no external changes. Sometimes it works, sometimes it freezes here:


    12/07/03 14:49:35 INFO fs.JobTrackerWatcher: Current running JobTracker is: wk1.sf.drawntoscale.com/192.168.10.2:9001
    2012-07-03 14:49:35,3851 ERROR Client fs/client/fileclient/cc/client.cc:3104 Thread:  139951174268672 rpc 28.23 to 192.168.10.2:5660, fid 2053.502.876780, upd 1, failed err 104

After this message, it goes no further. Job does not show up in jobtracker.

Sometimes I do not get this error message.


Control-C will not kill the job. I have to do jps and kill-9 the RunJar.

Outcomes