There are many ways to represent the +1 million UTF-8 characters. Take the latin capital \"A\" with macron (Ā). This is unicode code point U+0100,
Ā
U+0100
:) of course last one wouldn't work. \x sequence belongs to the double-quoted strings.
what's wrong with $char = chr(196).chr(128); ? with chr($a).chr($b) I mean.
$char = chr(196).chr(128);