Spark Parquet Statistics(min/max) integration

后端 未结 3 1465
情深已故
情深已故 2021-01-02 13:38

I have been looking into how Spark stores statistics (min/max) in Parquet as well as how it uses the info for query optimization. I have got a few questions. First setup: Sp

3条回答
  •  南笙
    南笙 (楼主)
    2021-01-02 14:17

    This has been resolved in Spark-2.4.0 version. In here they have upgraded parquet version from 1.8.2 to 1.10.0.

    [SPARK-23972] Update Parquet from 1.8.2 to 1.10.0

    With these all column types, whether they are Int/String/Decimal will contain min/max statistics.

提交回复
热议问题