I am pulling Twitter data using Scala and save it into HDFS in json format. I created a data frame with:
val df = spark.read.schema(schema).option("multili