问题
What is the purpose of intervals field in GranularitySpec and importantly, how does it impact the data load or ingestion ?
For e.g. If I have data from 2014-01-01 to 2017-09-30
in a file and I specify intervals value as "intervals" : ["2016-01-01/2017-12-31"]
. What would happen during data load in this scenario ?
Would really appreciate any help with this understanding. Thanks.
The document just says as below without much info on its impact:
intervals: A list of intervals for the raw data being ingested.
回答1:
The intervals field acts like a filter. If you have your intervals in the granularitySpec set to 2016-01-01/2017-12-31, then any data outside of that range will not be loaded.
来源:https://stackoverflow.com/questions/46948003/how-intervals-field-in-granularityspec-impact-data-ingestion