I have a MR job which is timing out on some nodes with below error. This is causing in Reduce Phase any Idea why this issue is repeating on only few nodes.
AttemptID:attempt_1457619643458_2273_r_000064_0 Timed out after 600 secs
Container killed by the ApplicationMaster. Container killed on request. Exit
code is 143 Container exited with a non-zero exit code 143