Spark, ML, StringIndexer: handling unseen labels

前端 未结 5 461
夕颜
夕颜 2020-12-08 05:12

My goal is to build a multicalss classifier.

I have built a pipeline for feature extraction and it includes as a first step a StringIndexer transformer to map each c

5条回答
  •  佛祖请我去吃肉
    2020-12-08 05:40

    In my case, I was running spark ALS on a large data set and the data was not available at all partitions so I had to cache() the data appropriately and it worked like a charm

提交回复
热议问题