Splitting a dictionary in a Pyspark dataframe into individual columns
问题 I have a dataframe (in Pyspark) that has one of the row values as a dictionary: df.show() And it looks like: +----+---+-----------------------------+ |name|age|info | +----+---+-----------------------------+ |rob |26 |{color: red, car: volkswagen}| |evan|25 |{color: blue, car: mazda} | +----+---+-----------------------------+ Based on the comments to give more: df.printSchema() The types are strings root |-- name: string (nullable = true) |-- age: string (nullable = true) |-- dict: string