How can I best determine the correct capitalization for a word?

前端 未结 3 472
一整个雨季
一整个雨季 2020-12-31 14:32

I have a database containing sentences which only contain capitalized letters. The database is technical, containing medical terms, and I want to normalize it so that the ca

3条回答
  •  爱一瞬间的悲伤
    2020-12-31 15:16

    Search for work on truecasing: http://en.wikipedia.org/wiki/Truecasing

    It would be really easy to generate your own data set if you have access to similar medical data with normal capitalization. Capitalize everything and use the mapping to the original text to train/test your algorithm.

提交回复
热议问题