I want to create my own transformer for use with the sklearn Pipeline. Hence I am creating a class that implements both fit and transform methods. The purpose of the transfo
Use "deep-copies" further on, down the pipeline and X, y remain protected
X
y
.fit() can first assign on each call deep-copy to new class-variables
.fit()
self.X_without_NaNs = X.copy() self.y_without_NaNs = y.copy()
and then reduce / transform these not to have more NaN-s than ordered by self.treshold
NaN
self.treshold