Efficiently count word frequencies in python

前端 未结 8 1166
走了就别回头了
走了就别回头了 2020-11-29 04:33

I\'d like to count frequencies of all words in a text file.

>>> countInFile(\'test.txt\')

should return {\'aaa\':1, \'bbb\':

8条回答
  •  失恋的感觉
    2020-11-29 05:12

    you can try with sklearn

    from sklearn.feature_extraction.text import CountVectorizer
        vectorizer = CountVectorizer()
    
        data=['i am student','the student suffers a lot']
        transformed_data =vectorizer.fit_transform(data)
        vocab= {a: b for a, b in zip(vectorizer.get_feature_names(), np.ravel(transformed_data.sum(axis=0)))}
        print (vocab)
    

提交回复
热议问题