I\'m a newbie in Pyspark programming. I need some help. I have a dataset with a categorical feature and some associated numerical values with it. I would like to vectorize the c