Submitted job intermittently freezing/failing

Question asked by bradford on Jul 3, 2012
A submitted job occasionally fails to launch on jobtracker with no external changes. Sometimes it works, sometimes it freezes here:

    12/07/03 14:49:35 INFO fs.JobTrackerWatcher: Current running JobTracker is:
    2012-07-03 14:49:35,3851 ERROR Client fs/client/fileclient/cc/ Thread:  139951174268672 rpc 28.23 to, fid 2053.502.876780, upd 1, failed err 104

After this message, it goes no further. Job does not show up in jobtracker.

Sometimes I do not get this error message.

Control-C will not kill the job. I have to do jps and kill-9 the RunJar.