Filter by Answers and Ideas

22 updates since last visit
mandoskippy
I went to go try playing with with librdkafka to use streams today.  My goal was to get something that used librdkafka working with the MapR librdkafka. While I am not good at any of this stuff, I did once stay at a Holiday Inn Express.    In summary, I found that I MapR likely used version 0.9.1 as a starting base for their librdkafka.  While… (Show more)
in Answers
mandoskippy
I am looking for a nice way to utilize netflow at massive scale... obviously MapR comes to mind... but there are things that have to be figured out... that said, I found a neat repo that seems to solve many of them:   GitHub - VerizonDigital/vflow: Enterprise Network Flow Collector (IPFIX, sFlow, Netflow)    It's designed to by multi-master,… (Show more)
in Answers
nshanmugam
I am trying to install MapR on a single node VM for basic development and testing. Have created a seperate partition of 500Gb. The partition can be accessed and is unmounted and even mkfs.ext3 runs smoothly on that. But during installation i am not able to select that partition, it's grayed out.    What are the other conditions for a disk to be… (Show more)
in Answers
john.humphreys
We're in the process of migrating a legacy batch-based system (Spark) to a streaming system (also Spark). The old system digests very large data files to a columnar database using Sqoop. The new system should digest the same data to OpenTSDB over MapR-DB. We need to shut down our legacy database and replace it with OpenTSDB as our first step… (Show more)
in Answers
john.humphreys
Hey,   We're building a system using OpenTSDB on top of MapR-DB and it is working quite well.   Before getting too invested though, we wanted to understand our options.  We're keen to use MapR-DB for its speed and simplicity.  We're also pretty excited about Apache Phoenix (it would make the target system much more powerful and flexible).  Will… (Show more)
in Answers
evckumar1
Hello Team,   We have a new requirement that where everyone asking us to configure to kill user jobs (Yarn/Spark/Hive etc) which runs > x number of hours. Is there a parameter that we can setup in MapR? Monitoring cluster 24*7 and killing manually is lot of effort. Apparently the users are claiming that this possible in Horton works using Ambari.… (Show more)
in Answers
Steve.HL.Wong
My question is what is the outcome when 2 clients try to connect to same VIP pool. According to  MapR documentmentation: I just wonder why we need 2 VIP Pools. What if Client 2 try to connect to VIP Pool A?
in Answers
Karthee
Hi All, In my three node cluster, i have optimized all the required parameters for the performance. But this is not much helping in my case, All our hive tables are created with parquet format, when my team tries to load from external table to internal table, please find the script below,   ksh -c 'hadoop fs -rm -R… (Show more)
in Answers
karthikSpark
hi all, I'm trying to read kafka topic from a different host using spark streaming application. I have two hosts A and B. A has zookeeper and kafka(0.9.0.1) installed. B has spark installed on it. Now when im trying to read the kafka topic from A through spark streaming from B and persist the data in to B. Here is the issue, when i run the… (Show more)
in Answers
sumathi
Hi all,   I would like to know if the chunk size in Mapr is dynamic or static   Eg: if the set the chunk size to 64 gb, and file size is of 50gb wat will be the chunk
in Answers
dzndrx
Hi this is currently my setup, having a 2 node cluster bare metal servers. We started 2 node in order to perform the adding a new node to the cluster resulting to a three node cluster.   But before doing that we need to have a healthy 2 node cluster. Unfortunately hivemeta service is producing down alarm.   Here are the specs of two nodes 24… (Show more)
in Answers
slimbaltagi
Both Kafka Connect and StreamSets Data Collector are open source Apache licensed projects that can help you with getting event streams in and out of Apache Kafka/MapR Streams and build data pipelines.  Both Kafka Connect and StreamSets Data Collector have advantages and disadvantages. As I did not find anywhere a comparison of Kafka Connect versus… (Show more)
in Answers
dzndrx
Hi, My license just got expired but I cannot make my NFS go up (Even 1 instance).   How to register it back to Community edition to enable even 1 NFS gateway.   I've tried registering again my cluster on my account and choosing the Community edition license, but it wont generate any license it just loads and after that no license is generated… (Show more)
in Answers
Load more items