发表新帖

发表新帖

Am I passing the string correctly to the python library?

后端未结

关注

 3  1110

终归单人心 2021-01-29 07:17

I\'m using a python library called Guess Language: http://pypi.python.org/pypi/guess-language/0.1

\"justwords\" is a string with unicode text. I stick it in the package,

3条回答

长发绾君心 (楼主)

2021-01-29 07:50

It looks like you should be able to pass your unicode as-is. guessLanguage decodes an input that is str as utf-8. So your .encode('utf-8') is safe but unnecessary.

I skimmed the source code and assumed it relies exclusively on the data in its "trigrams" directory for language detection, and it would not handle Japanese because there is no "ja" subdirectory in there. That is not correct, as pointed out by John Machin. So I have to assume your input is not what you think it is (which is hard to debug since it's not showing up correctly in your question).

0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题