AnsweredAssumed Answered

Cascading on MapR 4.0.0-FCS

Question asked by sumitsu on Sep 11, 2014
I am attempting to run a Cascading flow on a 3-node cluster of MapR 4.0.0-FCS running on CentOS 6.5.

The first problem I encountered is that I seem to be unable to install mapr-cascading via the normal procedure [outlined here][1]: http://doc.mapr.com/display/MapR/Cascading; `yum install` is unable to find any packages matching `mapr-cascading`, though I have the following repositories configured:

    [maprtech]
    name=MapR Technologies
    baseurl=http://package.mapr.com/releases/v4.0.0-FCS/redhat
    enabled=1
    gpgcheck=0
    protect=1
    
    [maprecosystem]
    name=MapR Technologies
    baseurl=http://package.mapr.com/releases/ecosystem/redhat
    enabled=1
    gpgcheck=0
    protect=1

I was able to install the package manually by downloading the RPM from here: [http://archive.mapr.com/releases/ecosystem-all/redhat/][2], though the latest version I was able to find was Cascading 2.1.6; is that the most recent version available?

The second problem encountered -- and one which is preventing the flow from executing -- is that attempts to run the flow stall out with a stream of continuing connection errors like the following:

    14/09/11 10:35:52 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)
    14/09/11 10:35:53 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 1 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)
    14/09/11 10:35:54 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 2 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)
    14/09/11 10:35:55 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 3 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)
    14/09/11 10:35:56 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 4 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)
    14/09/11 10:35:57 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 5 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)
    14/09/11 10:35:58 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 6 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)
    14/09/11 10:35:59 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 7 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)
    14/09/11 10:36:00 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 8 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)
    14/09/11 10:36:01 INFO ipc.Client: Retrying connect to server: mapr1/10.10.16.43:8032. Already tried 9 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1 SECONDS)

Because I've been unable to get the `hadoop` command execution which runs the Cascade flow to find the Cascading libraries installed on MapR, tests undertaken thus far have used a "fat" JAR consisting of both my Cascading code and Cascading library dependencies (JAR-ed together by Maven).  I have tried both with `cascading-hadoop2-mr1` 2.5.6 and with `cascading-hadoop` 2.1.6, with the same results.

Could anyone offer any insight as to what might be the problem?  Thanks.


  [1]: http://doc.mapr.com/display/MapR/Cascading
  [2]: http://archive.mapr.com/releases/ecosystem-all/redhat/

Outcomes