Remove “empty” character from String

后端 未结 9 1967
不思量自难忘°
不思量自难忘° 2020-12-14 10:42

I\'m using a framwork which returns malformed Strings with \"empty\" characters from time to time.

\"foobar\" for example is represented by: [,f,o,o,b,a,r]

T

9条回答
  •  隐瞒了意图╮
    2020-12-14 11:39

    Regex would be an appropriate way to sanitize the string from unwanted Unicode characters in this case.

    String sanitized = dirty.replaceAll("[\uFEFF-\uFFFF]", ""); 
    

    This will replace all char in \uFEFF-\uFFFF range with the empty string.

    The [...] construct is called a character class, e.g. [aeiou] matches one of any of the lowercase vowels, [^aeiou] matches anything but.

    You can do one of these two approaches:

    • replaceAll("[blacklist]", "")
    • replaceAll("[^whitelist]", "")

    References

    • regular-expressions.info

提交回复
热议问题