We are starting out on exploring MapR to be our "Data Lake" Cluster.
How should each developer's workstation be configured? We have Windows workstations.
Should they just have the MapR clients and configure your conf files to point to a MapR dev cluster to run the jobs? The issue that I see with this approach is if the dev cluster is down, developer productivity is lost, as they cant run the jobs
Should we have MapR VM Sandbox installed on every developer workstation, which will allow them to be completely independent? The issue with this approach is that I would need a 16GB RAM (8GB is for the VM itself) workstation.
What is the recommended approach?