AnsweredAssumed Answered

How optimize Spark/Yarn configuration ?

Question asked by ANIKADOS on Sep 12, 2017
Latest reply on Sep 25, 2017 by ANIKADOS

We have a cluster of 4 nodes with the characteristics above

https://i.stack.imgur.com/lWmef.png

Spark jobs make a lot of times in processing, how could we optimize this time, knowing that our jobs run from RStudio and we still have a lot of memory not utilized.

 

 

https://i.stack.imgur.com/Dms2c.png

Outcomes