发表新帖

发表新帖

Put customized functions in Sklearn pipeline

后端未结

关注

 2  2063

礼貌的吻别 2020-12-30 14:52

In my classification scheme, there are several steps including:

SMOTE (Synthetic Minority Over-sampling Technique)
Fisher criteria for feature selecti

2条回答

孤独总比滥情好 (楼主)

2020-12-30 15:20
scikit created a FunctionTransformer as part of the preprocessing class in version 0.17. It can be used in a similar manner as David's implementation of the class Fisher in the answer above - but with less flexibility. If the input/output of the function is configured properly, the transformer can implement the fit/transform/fit_transform methods for the function and thus allow it to be used in the scikit pipeline.

For example, if the input to a pipeline is a series, the transformer would be as follows:
```
def trans_func(input_series):
return output_series

from sklearn.preprocessing import FunctionTransformer
transformer = FunctionTransformer(trans_func)

sk_pipe = Pipeline([("trans", transformer), ("vect", tf_1k), ("clf", clf_1k)])
sk_pipe.fit(train.desc, train.tag)
```
where vect is a tf_idf transformer, clf is a classifier and train is the training dataset. "train.desc" is the series text input to the pipeline.
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...

热议问题