I am building a multi-temperate data store where I want to use MapR as the "hot/warm" storage and S3 for the "cold/long term" storage and OLAP data analytics.
Do you have a write-up, white paper, or instructions on how to use MapR with S3 in this way?
What I'm asking for is a little more than just having both a MapR and S3 data store and query between the two. What I need is a system where data is rolled-off of MapR after a set time period (or disk usage warnings) onto S3. But, being able to query seamlessly between the two so that the queries are transparent to the user.
Thanks in advance