AnsweredAssumed Answered

Input path does not exist: maprfs:

Question asked by mapr123 on Sep 3, 2013
Latest reply on Sep 3, 2013 by mapr123

I am trying to run a WordCount MR job (the one from Apache, not that comes with the MAPR install).
I added in my data files via NFS. I am getting the error "InvalidInputException: Input path does not exist: maprfs:/maprmountpoint/LARGEDATA/1992".
Not sure what the problem is? Any ideas?


[root LARGEDATA]#  hadoop jar wordcount.jar WordCount /maprmountpoint/LARGEDATA/* /maprmountpoint/LARGEDATA_OUTPUT/run1

13/09/03 14:09:27 INFO fs.JobTrackerWatcher: Current running JobTracker is:
13/09/03 14:09:27 WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Applications should implement Tool for the same.
13/09/03 14:09:27 WARN mapred.JobClient: No job jar file set.  User classes may not be found. See JobConf(Class) or JobConf#setJar(String).
13/09/03 14:09:27 INFO mapred.JobClient: Cleaning up the staging area maprfs:/var/mapr/cluster/mapred/jobTracker/staging/root/.staging/job_201308261533_0009
Exception in thread "main" org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: maprfs:/maprmountpoint/LARGEDATA/1992
        at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.listStatus(
        at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.getSplits(
        at org.apache.hadoop.mapred.JobClient.writeNewSplits(
        at org.apache.hadoop.mapred.JobClient.writeSplits(
        at org.apache.hadoop.mapred.JobClient.access$500(
        at org.apache.hadoop.mapred.JobClient$
        at org.apache.hadoop.mapred.JobClient$
        at Method)
        at org.apache.hadoop.mapred.JobClient.submitJobInternal(
        at org.apache.hadoop.mapreduce.Job.submit(
        at org.apache.hadoop.mapreduce.Job.waitForCompletion(
        at WordCount.main(
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(
        at java.lang.reflect.Method.invoke(
        at org.apache.hadoop.util.RunJar.main(

pwd is "/maprmountpoint/"
drwxrwxrwx 14 root mapr 12 Sep  3 13:37 LARGEDATA
drwxrwxrwx  5 root mapr  3 Sep  3 13:39 LARGEDATA_OUTPUT

Input Files (all are root:mapr with 777)

[root LARGEDATA]# ls -l
total 33
drwxrwxrwx 2 root mapr 1367 Sep  3 13:35 1992

Output Files (all are root:mapr with 777)

[root LARGEDATA_OUTPUT]# ls -lrt
total 0
drwxrwxrwx 2 root mapr 0 Sep  3 13:39 run1