How can I best determine the correct capitalization for a word?

前端 未结 3 473
一整个雨季
一整个雨季 2020-12-31 14:32

I have a database containing sentences which only contain capitalized letters. The database is technical, containing medical terms, and I want to normalize it so that the ca

3条回答
  •  死守一世寂寞
    2020-12-31 15:27

    Easiest way to do this is to use a spell correction algorithm based on ngrams.

    You can use, for example LingPipe SpellChecker. You can find source code for predicting spaces in word, similar to what can be done for predicting case.

提交回复
热议问题