How can I find the size of a RDD

前端 未结 5 1151
失恋的感觉
失恋的感觉 2020-12-04 19:45

I have RDD[Row], which needs to be persisted to a third party repository. But this third party repository accepts of maximum of 5 MB in a single call.

S

5条回答
  •  陌清茗
    陌清茗 (楼主)
    2020-12-04 20:33

    This is going to depend on factors such as serialization, so it is not cut and dry. However, you could take a sample set and run some experimentation on that sample data, extrapolating from there.

提交回复
热议问题