I have a dataframe column that has the location of s3 objects and I want to get the hash of these s3 objects using pyspark\'s rdd.map. When I run the code below to get the h