AnsweredAssumed Answered

Drill + Evolving Parquet Schema

Question asked by john.humphreys on Jan 8, 2018
Latest reply on Jan 9, 2018 by john.humphreys

Let's say I have one parquet file per day, one row per host.  Each column is a metric for that host.

 

Hostmemory-usedcpunet-in-rate
A12345322873
B23456463872

 

Drill can scan over many days of files and use them fine.


What if I add and/or remove columns to the newer files over time though.  Can drill deal with looking up "cpu" when some of the files don't have CPU, etc?

Outcomes