I am reading some text files in a Java program and would like to replace some Unicode characters with ASCII approximations. These files will eventually be broken into sente
I found a pretty extensive table that maps Unicode punctuation to their closest ASCII equivalents.
Here's more info: Map Symbols & Punctuation to ASCII.