I have another thread going here WIll Phoenix be able to run on MapR-DB? which has basically evolved into discussing OpenTSDB vs Drill (or using both which looks like the likely solution).
In this case, I'm interested in knowing the optimal data layout and backing store for querying large amounts of time series data with drill assuming I'm moving forward with it.
- Is it better to store data in MapR-DB or in MapR-FS as parquet files or in MapR-FS as JSON files? Is there another, better option I failed to mention?
- Why is this the better option?
- What data layout should I use? E.g. if MapR-DB, how should my JSON look? Does how I write my parquet file matter (in order, timetsamp in first column or host name in first column, etc)?