AnsweredAssumed Answered

How the MapR balance data within the whole cluster?

Question asked by zoglee on Sep 21, 2011
Latest reply on Sep 22, 2011 by zoglee
I setup a 6-nodes cluster for 1 week, and I find the data seems not well balanced among all disks in the cluster.

Take today's data for instance:

> master: **18%** of 652.0GB in use for 1
> MapR-FS disk(s)  (used: **120.0 GB**)
>
> slave1: 9% of 942.0GB in use for 1
> MapR-FS disk(s)   (used: 88.0 GB)
>
> slave2: 8% of 946.0GB in use for 1
> MapR-FS disk(s)   (used: 71.0 GB)
>
> slave3: 8% of 946.0GB in use for 1
> MapR-FS disk(s)   (used: 75.0 GB)
>
> slave4: 7% of 952.0GB in use for 1
> MapR-FS disk(s)   (used: 71.0 GB)
>
> slave5: 8% of 942.0GB in use for 1
> MapR-FS disk(s)   (used: 73.0 GB)

Data seems prefer to the master-node, but why?

And what is the strategy for the data-balancing?

How can I balance data in my cluster?

Outcomes