Products & Services
MapR Book Club
to create and rate content, and to follow, bookmark, and share content with other members.
Install Spark for M7(4.0.2) on Amazon EMR
Question asked by
on Oct 7, 2015
on Dec 7, 2015 by davidtucker
Show 0 Likes
How can I install or use Spark for M7(4.0.2) on Amazon EMR ?
No one else has this question
Mark as assumed answered
This content has been marked as final.
Show 1 comment
(Required, will not be published)
Dec 7, 2015 4:32 PM
The mapr-spark packages will work just fine in EMR. You can install them yourself, or use the bootstrap action at s3://maprtech-emr/scripts/mapr-spark-bootstrap.sh . Adding that action as an additional step to your EMR deployment will install the latest supported Spark (currently 1.4.1) on the cluster.
The default mode for the mapr-spark package is Spark on YARN. The simplest way to launch a Spark job would be to log in to the EMR Master node and simply run the job (or invoke /opt/mapr/spark/spark-*/bin/spark-shell) from that server.
Show 0 Likes
Retrieving data ...
Data Science Refinery Library (blogs)
Is it possible to run an Oozie Spark Action without specifying inputDir & outputDir
Can we use Hunk with MapR using NFS for files?
Resource Manager/Scheduler Address
No NFS license community 5.2.2