why cldb and ZooKeeper should be run separate control nodes in large cluster?
Hello Punith VP
In a large cluster (100 nodes or more) create CLDB-only nodes to ensure high performance. This configuration also provides additional control over the placement of the CLDB data, for load balancing, fault tolerance, or high availability (HA). Setting up CLDB-only nodes involves restricting the CLDB volume to its own topology and making sure all other volumes are on a separate topology. Because both the CLDB-only path and the non-CLDB path are children of the root topology path, new non-CLDB volumes are not guaranteed to keep off the CLDB-only nodes.
Also when the TaskTracker service is running on a node that is also running the CLDB or ZooKeeper services, consider reducing the number of task slots that this node's instance of the TaskTracker service provides.
Please Check the below Link for more info
Isolating CLDB Nodes - MapR 5.0 Documentation - doc.mapr.com
Thanks and Best Regards
Thank you Mina! Punith VP let us know if this helps
Retrieving data ...