I build a dataframe then write it out to s3 like so:
df.write.mode("overwrite").option("compression"