Perl and some other current regex engines support Unicode properties, such as the category, in a regex. E.g. in Perl you can use \\p{Ll} to match an arbitrary l
\\p{Ll}
The regex module (an alternative to the standard re module) supports Unicode codepoint properties with the \p{} syntax.
re
\p{}