AnsweredAssumed Answered

How to create RDD by using external dataset which is saved on filesystem in MapR?

Question asked by ukolthur on Jun 26, 2018
Latest reply on Jun 29, 2018 by MichaelSegel

I am new to working in  Mapr cluster.

I have placed wordcount.txt in Filesystem

 

hadoop fs -put wordcount.txt  test.

 

When I query below,I am getting contents displayed.

hadoop fs -cat test/wordcount.txt

 

I want to create SparkRDD by loading it as external dataset and apply map transformations on it.

for the same I am using below

 

in HDFS I can use like below..

val a=sc.textFile("hdfs://test/wordcount.txt");

 

In MaprFS How I can access that file.what is the path?

 

Could you please tell me the command??.

Outcomes