发表新帖

发表新帖

sklearn.LabelEncoder with never seen before values

后端未结

关注

 12  1015

执笔经年 2020-11-27 10:37

If a sklearn.LabelEncoder has been fitted on a training set, it might break if it encounters new values when used on a test set.

The only solution I c

12条回答

一向 (楼主)

2020-11-27 11:17

I know two devs that are working on building wrappers around transformers and Sklearn pipelines. They have 2 robust encoder transformers (one dummy and one label encoders) that can handle unseen values. Here is the documentation to their skutil library. Search for skutil.preprocessing.OneHotCategoricalEncoder or skutil.preprocessing.SafeLabelEncoder. In their SafeLabelEncoder(), unseen values are auto encoded to 999999.

0 讨论(0)

查看其它12个回答
发布评论:

提交评论
- 加载中...

热议问题