Data imputation with fancyimpute and pandas
问题 I have a large pandas data fame df . It has quite a few missings. Dropping row/or col-wise is not an option. Imputing medians, means or the most frequent values is not an option either (hence imputation with pandas and/or scikit unfortunately doens't do the trick). I came across what seems to be a neat package called fancyimpute (you can find it here). But I have some problems with it. Here is what I do: #the neccesary imports import pandas as pd import numpy as np from fancyimpute import KNN