Filter by Answers and Ideas

2 updates since last visit
Hao Zhu
Author: Hao Zhu   Original Publication Date: June 5, 2015   Environment : Spark 1.3.1 Goal This article introduces the behavior of Hive outer join. Solution From Hive Outer Join Behavior, here are the definitions of Preserved Row table, Null Supplying table, During Join predicate and After Join predicate(Where predicate). Take a left outer…
in Answers
rbukarev
1. I noticed that the example for using MapR-DB OJAI Connector for Spark discusses the Streaming mode, but doesn't touch Structured Streaming. With the latter being announced production-ready now, is there any way to use that Connector, too?   2. The standard doco doesn't mention that, but I guess the method "MapRDBSpark.newDocument(x)" expects… (Show more)
in Answers
walidaoudi
My OS Environment : Windows 10. mapr-client-6.0.1 (installed and configured) + MapR-Sandbox-For-Hadoop-6.0.1. I have tested for hadoop.spoofed.user with user "mapr" and with user "root" and I am facing the same issue. when I do : hadoop fs -ls /mapr (the command is "working" however showing the content of /mapr/demo.mapr.com on the server… (Show more)
in Answers
sn55179
Hi ,   I'm trying to load Map-R DB json in spark and createOrReplaceTempView on top of it.   Is it possible ?    val json = sc.loadFromMapRDB("/usr/hdfs/docdb/data/emp_snapshot")   The above statement is returning  - val json: MapRDBTableScanRDD[OJAIDocument]   But ,  I think i need to derive a DataFrame to create temp view.   How can… (Show more)
in Answers
terryhealy
Running V6.0.1 / MEP 5.0.0 IntelliJ on a system with MapR Client and Spark installed.   I'm trying to write a DataFrame to a MapR-DB JSON file. I'm reading a .csv file and filtering some fields and adding an _id field. In the code below I can't find a reference to saveToMapRDB() for the DataFrame, despite trying all sorts of recommended… (Show more)
in Answers
terryhealy
We recently upgraded our cluster to 6.0.1 / MEP 5.0.0 and are trying to modernize storage of our primary datasets. One of these is Netflow data, which is stored in a MapR-DB JSON table. (Bad choice??) I'm able to do very crude searches using mapr dbshell, but previously used Impala. All indications are that the cool kids are all using Drill; but… (Show more)
in Answers
john.humphreys
I've been pumping billions of records into a single topic in a MapR-Stream daily for close to a year now, and I'm noticing that the region count keeps going up and up and up.  It has 5,584 regions now.   I regularly call the purge endpoint and have a 4 day TTL on messages.  The size of the stream seems to keep pretty constant at 700GB physical… (Show more)
in Answers
thomaztony
Can TIBCO Spotfire be integrated with MapR Hadoop distribution on AWS?
in Answers
PETER.EDIKE
Hello Everyone,   I am trying to implement a simple read and print messages from a mapr-streams topic using the spark structured streaming using the following code on spark-shell import org.apache.spark.sql.SQLContext import org.apache.spark.rdd.RDD import org.apache.spark.SparkConf import org.apache.spark.sql.SparkSession import… (Show more)
in Answers
gesgeorge
Hi,   I'm looking for any tools/open source projects that already exist that could help with moving data from MongoDB Collections to MapR-DB tables. I can certainly write code to do this but I wanted to explore any out of the box solutions that might already exist and/or hear from anyone who had migrated data from Mongo to Mapr-DB JSON tabes. One… (Show more)
in Answers
SubashKunjupillai
Hi,   We have MapR 5.2.1 installed in 20 odd different customer sites. Since the effect of GDPR, we are in the process of enabling security at mapr layer and looking at other vulnerabilities within our product. During that course we found that we have been creating 'mapr' user for which sudo permission is being provided during installation (Not… (Show more)
in Answers
Velumani
Hi, I am trying to setup MapR 6.0.1 in aws using CloudFormation with reference to the blog 9 Steps to Deploying the MapR Converged Data Platform on AWS | MapR.    While creating the Stack, I used default template URL (https://s3.amazonaws.com/awsmp-fulfillment-cf-templates-prod/c4e41a39-a53d-4f83-84a0-760f94bc31b2.8d0a9abb-2eca-4aa1-b7a… ) and… (Show more)
in Answers
Load more items