I need to identify what natural language my input belongs to. The goal is to distinguish between Arabic and English words in a mixed input, where the inpu
The Unicode Script property is probably more useful. In Java, it can be looked up using the java.lang.Character.UnicodeScript class:
Character.UnicodeScript script = Character.UnicodeScript.of(c);