Language detection with data in PostgreSQL

后端 未结 6 577
我在风中等你
我在风中等你 2020-12-31 15:17

I have a table in PostgreSQL where a column is a text. I need a library or tool that can identify the language of each text for a test purpose.

There is no need for

6条回答
  •  情书的邮戳
    2020-12-31 15:42

    Try these:

    • http://code.google.com/p/language-detection/ (Java)
    • http://code.google.com/p/chromium-compact-language-detector/ (C++/Python)

    This blog post shares some tests to compare the 2 libraries (along with a 3rd - the Language Identification module of Apache Tika, which really is a complete toolkit for Text Analysis).

提交回复
热议问题