get filename and file modification/creation time as (key, value) pair into RDD using pyspark

前端 未结 0 997
野趣味
野趣味 2020-12-16 19:59

I have folders with many many files (e.g. over 100k), some files small (less than 1kb) and some files big (e.g. several MBs).

I would like to use pyspark and scan all

相关标签:
回答
  • 消灭零回复
提交回复
热议问题