I am trying to implement the SelectKBest algorithm on my data to get the best features out of it. For this I am first preprocessing my data using DictVectorizer and the data
If your data has high cardinality because it represents text, you can try using a resource-friendlier vectorizer like HashingVectorizer