How do I store a TfidfVectorizer for future use in scikit-learn?

后端 未结 3 976
不思量自难忘°
不思量自难忘° 2020-12-08 03:21

I have a TfidfVectorizer that vectorizes collection of articles followed by feature selection.

vectroizer = TfidfVectorizer()
X_train = vectroiz         


        
3条回答
  •  情书的邮戳
    2020-12-08 03:38

    Here is my answer using joblib:

    joblib.dump(vectorizer, 'vectroizer.pkl')
    joblib.dump(selector, 'selector.pkl')
    

    Later, I can load it and ready to go:

    vectorizer = joblib.load('vectorizer.pkl')
    selector = joblib.load('selector.pkl')
    
    test = selector.trasnform(vectorizer.transform(['this is test']))
    

提交回复
热议问题