AnsweredAssumed Answered

Usage of Distributed Cache

Question asked by sxsnyc on Mar 29, 2012
Latest reply on Mar 30, 2012 by sxsnyc
We're migrating from Cloudera 20.2cdhu2 to MapR v. 1.2.3.12961.GA and we're using the DistributedCache.

In our mapper's setUp method, we use the file to initialize a reference data class.  The path the we use is relative to the hdfs user.

How does MapR initialize the Distributed Cache and what does the URI look like if we are referencing a file in HDFS?

    java.io.FileNotFoundException: Requested file path/path/a_reference_file.tsv does not exist
     at com.mapr.fs.MapRFileSystem.getMapRFileStatus(MapRFileSystem.java:497)
     at com.mapr.fs.MapRFileSystem.getFileStatus(MapRFileSystem.java:504)
     at org.apache.hadoop.filecache.TaskDistributedCacheManager.setupCache(TaskDistributedCacheManager.java:180)
     at org.apache.hadoop.mapred.TaskTracker$4.run(TaskTracker.java:1560)
     at java.security.AccessController.doPrivileged(Native Method)
     at javax.security.auth.Subject.doAs(Subject.java:416)
     at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1109)

Thanks for your help.

Outcomes