AWS Marketplace Deployment

Document created by wochanda Employee on Aug 22, 2017Last modified by nsen on Oct 17, 2017
Version 7Show Document
  • View in full screen mode

Introduction

The fastest and easiest way to get started with MapR on AWS is through the AWS Marketplace.  The AWS Marketplace also allows you to enable MapR Enterprise features like mirroring and replication while paying by the hour through your AWS account.  MapR has four products listed in the AWS Marketplace in order to accommodate different capability and billing preferences.

ProductComponents IncludedLicensed Hourly?Customizable?
MapR CDP - Analytics Bundle

MapR-XD, Apache Spark, Apache Drill, Apache Hadoop

Enterprise Features Enabled, Includes MapR Support

YesNo
MapR CDP - Converged Bundle

MapR-XD, MapR-DB, MapR-ES, Apache Spark, Apache Drill, Apache Hadoop

Enterprise Features Enabled, Includes MapR Support

YesYes
MapR CDP - Community Edition

MapR-XD, MapR-DB, MapR-ES, Apache Spark, Apache Drill, Apache Hadoop

Enterprise Features Disabled, No MapR Support

NoYes
MapR CDP - BYOL

MapR-XD, MapR-DB, MapR-ES, Apache Spark, Apache Drill, Apache Hadoop

Enterprise Features and Support Available Separately

NoYes

AWS Permissions Required

Deploying software through the AWS Marketplace requires full administrative control over your AWS account, including the ability to create and manage user roles, networks, SSH keys, and VM instances.  If you are trying to deploy MapR from an account with restricted privileges, it may be more appropriate to try one of the pre-defined examples.

 

Deployment Steps

Choose A Marketplace Product

  1. Click one of the product links in the table above, or visit the MapR AWS Seller page.  
  2. Select your preferred region in the 'For Region' dropdown, then click 'Continue'
  3. If prompted in the top-right corner of the page, click 'I Accept Software Terms', then 'Return to Launch Page'.
  4. Select preferred MapR Version, Region.
  5. Click 'Launch with CloudFormation Console'.
  6. From 'Create Stack' page, click 'Next'.

 

Configure Your Stack

(Selections here will differ between examples.)

  1. Stack name - Unique name for this deployment in the AWS region within your account.  You may need this later to modify or delete your cluster.
  2. MapR Cluster Configuration
    1. clusterName - Use a DNS-esque identifier for your cluster.
    2. clusterAdminPassword - This will be used along with the 'mapr' user to log into your cluster management console.
    3. MEP - This is the version of the 'MapR Ecosystem Pack' that will be deployed.
    4. provisioningTemplate - This specifies the specific Apache open-source components that will be deployed on your cluster.
      1. Converged-Platform - Deploys Apache Hadoop, Hive, Spark, and Drill along with the core MapR Data Services - MapR-XD, MapR-DB, and MapR-ES.
      2. Data-Lake - Deploys Apache Hadoop, Hive, Spark, and Drill along with MapR-XD.
      3. Operational-Analytics - Deploys Apache Drill and MapR-DB.
      4. Real-Time-Analytics - Deploys Apache Spark with MapR-ES.
      5. Custom-Configuration - Allows you to fully customize the components to install at a later stage (see last step).
    5. nodeCount - the number of nodes you want in your cluster
  3. Node Configuration - Specify the instance type and storage configuration.  Some tips.
    1. d2 and i3 instances are optimized for local storage.  If these are selected we will ignore your disk choices and instead use the disks that come with the instance. Keep in mind that local storage data are not persisted between machine restarts.
    2. All other instances will use only EBS storage, so you should specify type/amount.
  4. AWS Infrastructure Configuration
    1. keyName - AWS keypair you'll be using to SSH into your deployed cluster. 
      1. If you don't have one, create one from AWS > Services > EC2 > Key Pairs.  Hold on to your private key.
    2. useSubnet - Specify subnet id that you either created manually or using the aws_cf_maprcluster_vpc.yml template. The subnet should be able to communicate to the Internet. Do not specify any value here to let the template create a VPC and subnet for you. It is recommended that you do leave this empty and let the stack create a VPC if you do not have an existing subnet available.
    3. securityGroups - If you specified a subnet value in the field above, you need to provide the list security groups that are required for that subnet for the nodes to be able to communicate with each other and the Internet. Note: definitely include the default security group for the VPC in your list of security groups. 
    4. assignPublicIP - create the nodes with public IP address
    5. publicAccessCIDR - the CIDR to restrict access to the VPC. If you use 0.0.0.0/0 as your CIDR, your subnet is accessible to everyone on the Internet. It is recommended to restrict it to your corporate IP range. This is used if you do not specify a value for useSubnet. 

 

Create Your Stack

Once you've configured your stack, click 'Next' at the next couple of pages.

When asked, check the box next to 'I acknowledge that AWS CloudFormation might create IAM resources with custom names' and click 'Create'.

 

Once you click 'create', you can view progress on the next page.  Make sure you filter for 'In Progress'.

 

(Optional) Customize Your Cluster

If for provisioningTemplate you selected 'Custom-Configuration' you can now fully customize your MapR cluster before installation.

 

  1. Find your stack in the CloudFormation Stack list.  Make sure it is in status of 'CREATE_COMPLETE'.
  2. Select the your stack, and in the bottom panel dropdown select 'Outputs'.
  3. Find your 'MaprInstallerUrl' and try to browse to it.
    1. If you can't access the URL, you may need to VPN into it.  If you selected OpenVPN=true when configuring your stack, you should have other outputs like 'OpenVPNUrl' to help you VPN into your cluster network.
  4. Sign In to your cluster using 'mapr' user and the password you configured.
  5. Click 'Next' and configure your cluster as you would for a non-cloud installation.
  6. Once complete, you'll see a page of URL endpoints, including a link to MCS for managing your cluster.

 

Access Your Cluster

  1. Find your stack in the CloudFormation Stack list.  Make sure it is in status of 'CREATE_COMPLETE'.
  2. Select the your stack, and in the bottom panel dropdown select 'Outputs'.
  3. Find your 'MaprInstallerUrl' and try to browse to it.
    1. If you can't access the URL, you may need to VPN into it.  If you selected OpenVPN=true when configuring your stack, you should have other outputs like 'OpenVPNUrl' to help you VPN into your cluster network.
  4. Sign In to your cluster using 'mapr' user and the password you configured.
  5. You will see a page containing a list of URL endpoints, including a link to MCS for managing your cluster.

Attachments

    Outcomes