Determine if text is in English?

后端 未结 6 1285
情深已故
情深已故 2020-12-15 23:34

I am using both Nltk and Scikit Learn to do some text processing. However, within my list of documents I have some documents that are not in English. For example, the follow

6条回答
  •  攒了一身酷
    2020-12-16 00:03

    There is a library called langdetect. It is ported from Google's language-detection available here:

    https://pypi.python.org/pypi/langdetect

    It supports 55 languages out of the box.

提交回复
热议问题