I have a database containing sentences which only contain capitalized letters. The database is technical, containing medical terms, and I want to normalize it so that the ca
Search for work on truecasing: http://en.wikipedia.org/wiki/Truecasing
It would be really easy to generate your own data set if you have access to similar medical data with normal capitalization. Capitalize everything and use the mapping to the original text to train/test your algorithm.