MemoryError in toarray when using DictVectorizer of Scikit Learn

后端 未结 7 2107
無奈伤痛
無奈伤痛 2021-01-06 05:47

I am trying to implement the SelectKBest algorithm on my data to get the best features out of it. For this I am first preprocessing my data using DictVectorizer and the data

7条回答
  •  梦谈多话
    2021-01-06 06:36

    If your data has high cardinality because it represents text, you can try using a resource-friendlier vectorizer like HashingVectorizer

提交回复
热议问题