we are just starting our Big Data journey.. and we decided to use MapR Enterprise in Production. starting with 3 nodes with 24TB each node.
I am task to import data from Oracle and MySQL to Hive. what is the best approach you will recommend?
Hello ronnie a. You may find this discussion helpful How to incrementally load data from oracle DB to MaprDB?. Not exactly the same source and destination combination, but should give you a sense of direction I'd think.
it doesn't answer my question, which one you prefer to use sqoop1 or sqoop2?
do you recommend using sqoop2?
Well, that's hard for me to suggest as the preference would depend on what it is that you are trying to do. I did a quick search though and found this in the CDH documentation. As per them, Sqoop2 is getting deprecated (probably in their distro).
Hope that helps a little.
Retrieving data ...