Language detection with data in PostgreSQL

后端 未结 6 554
我在风中等你
我在风中等你 2020-12-31 15:17

I have a table in PostgreSQL where a column is a text. I need a library or tool that can identify the language of each text for a test purpose.

There is no need for

6条回答
  •  耶瑟儿~
    2020-12-31 15:51

    I found a library called TextCat, which is available under LGPL. I can't say what the quality of its identification is, but it's got an online demo form, so maybe you can throw some text at it before deciding if its worth downloading.

    It's also written in Perl, so if you do want to use it, the approach in filiprem's answer would be a good start point.

提交回复
热议问题