I get user input including non-ASCII characters and non-printable characters, such as
\\xc2d \\xa0 \\xe7 \\xc3\\ufffdd \\xc3\\ufffdd \\xc2\\xa0 \\xc3\\xa7
You can use java.text.normalizer