Read files from S3 - Pyspark [duplicate]
问题 This question already has answers here : Spark Scala read csv file using s3a (1 answer) How to access s3a:// files from Apache Spark? (10 answers) S3A: fails while S3: works in Spark EMR (2 answers) Closed last year . I have been looking for a clear answer to this question all morning but couldn't find anything understandable. I just started to use pyspark (installed with pip) a bit ago and have a simple .py file reading data from local storage, doing some processing and writing results