AWS Marketplace Deployment

Document created by wochanda Employee on Aug 22, 2017Last modified by maprcommunity on Aug 29, 2017
Version 6Show Document
  • View in full screen mode

Introduction

The fastest and easiest way to get started with MapR on AWS is through the AWS Marketplace.  The AWS Marketplace also allows you to enable MapR Enterprise features like mirroring and replication while paying by the hour through your AWS account.  MapR has four products listed in the AWS Marketplace in order to accommodate different capability and billing preferences.

ProductComponents IncludedLicensed Hourly?Customizable?
MapR CDP - Analytics Bundle

MapR-XD, Apache Spark, Apache Drill, Apache Hadoop

Enterprise Features Enabled, Includes MapR Support

YesNo
MapR CDP - Converged Bundle

MapR-XD, MapR-DB, MapR-ES, Apache Spark, Apache Drill, Apache Hadoop

Enterprise Features Enabled, Includes MapR Support

YesYes
MapR CDP - Community Edition

MapR-XD, MapR-DB, MapR-ES, Apache Spark, Apache Drill, Apache Hadoop

Enterprise Features Disabled, No MapR Support

NoYes
MapR CDP - BYOL

MapR-XD, MapR-DB, MapR-ES, Apache Spark, Apache Drill, Apache Hadoop

Enterprise Features and Support Available Separately

NoYes

AWS Permissions Required

Deploying software through the AWS Marketplace requires full administrative control over your AWS account, including the ability to create and manage user roles, networks, SSH keys, and VM instances.  If you are trying to deploy MapR from an account with restricted privileges, it may be more appropriate to try one of the pre-defined examples.

 

Deployment Steps

Choose A Marketplace Product

  1. Click one of the product links in the table above, or visit the MapR AWS Seller page.  
  2. Select your preferred region in the 'For Region' dropdown, then click 'Continue'
  3. If prompted in the top-right corner of the page, click 'I Accept Software Terms', then 'Return to Launch Page'.
  4. Select preferred MapR Version, Region.
  5. Click 'Launch with CloudFormation Console'.
  6. From 'Create Stack' page, click 'Next'.

 

Configure Your Stack

(Selections here will differ between examples.)

  1. Stack name - Unique name for this deployment within your AWS account.  You may need this later to modify or delete your cluster.
  2. AWS Infrastructure Configuration
    1. keyName - AWS keypair you'll be using to SSH into your deployed cluster. 
      1. If you don't have one, create one from AWS > Services > EC2 > Key Pairs.  Hold on to your private key.
  3. MapR Cluster Configuration
    1. clusterName - Use a unique, DNS-esque identifier for your cluster.
    2. clusterAdminPassword - This will be used along with the 'mapr' user to log into your cluster management console.
    3. MEP - This is the version of the 'MapR Ecosystem Pack' that will be deployed.
    4. provisioningTemplate - This specifies the specific Apache open-source components that will be deployed on your cluster.
      1. Converged-Platform - Deploys Apache Hadoop, Hive, Spark, and Drill along with the core MapR Data Services - MapR-XD, MapR-DB, and MapR-ES.
      2. Data-Lake - Deploys Apache Hadoop, Hive, Spark, and Drill along with MapR-XD.
      3. Operational-Analytics - Deploys Apache Drill and MapR-DB.
      4. Real-Time-Analytics - Deploys Apache Spark with MapR-ES.
      5. Custom-Configuration - Allows you to fully customize the components to install at a later stage (see last step).
  4. Node Configuration - Specify the instance type and storage configuration.  Some tips.
    1. d2 and i3 instances are optimized for local storage.  If these are selected we will ignore your disk choices and instead use the disks that come with the instance.
    2. All other instances will use only EBS storage, so you should specify type/amount.
  5. Network, Access, Security
    1. useExistingSubnet - Specify subnet id that you either created manually or using the aws_cf_maprcluster_vpc.yml template.
    2. installOpenVPN - If your VPC/subnet are not accessible to the wide internet, you can use OpenVPN to connect your computer to the MapR subnet to access it.

 

Create Your Stack

Once you've configured your stack, click 'Next' at the next couple of pages.

When asked, check the box next to 'I acknowledge that AWS CloudFormation might create IAM resources with custom names' and click 'Create'.

 

Once you click 'create', you can view progress on the next page.  Make sure you filter for 'In Progress'.

 

(Optional) Customize Your Cluster

If for provisioningTemplate you selected 'Custom-Configuration' you can now fully customize your MapR cluster before installation.

 

  1. Find your stack in the CloudFormation Stack list.  Make sure it is in status of 'CREATE_COMPLETE'.
  2. Select the your stack, and in the bottom panel dropdown select 'Outputs'.
  3. Find your 'MaprInstallerUrl' and try to browse to it.
    1. If you can't access the URL, you may need to VPN into it.  If you selected OpenVPN=true when configuring your stack, you should have other outputs like 'OpenVPNUrl' to help you VPN into your cluster network.
  4. Sign In to your cluster using 'mapr' user and the password you configured.
  5. Click 'Next' and configure your cluster as you would for a non-cloud installation.
  6. Once complete, you'll see a page of URL endpoints, including a link to MCS for managing your cluster.

 

Access Your Cluster

  1. Find your stack in the CloudFormation Stack list.  Make sure it is in status of 'CREATE_COMPLETE'.
  2. Select the your stack, and in the bottom panel dropdown select 'Outputs'.
  3. Find your 'MaprInstallerUrl' and try to browse to it.
    1. If you can't access the URL, you may need to VPN into it.  If you selected OpenVPN=true when configuring your stack, you should have other outputs like 'OpenVPNUrl' to help you VPN into your cluster network.
  4. Sign In to your cluster using 'mapr' user and the password you configured.
  5. You will see a page containing a list of URL endpoints, including a link to MCS for managing your cluster.

Attachments

    Outcomes