I am trying to detect Arabic characters in a webpage\'s HTML using Notepad++ CTRL+F with regular expressions. I am entering the following as my search terms and it is return
Because Notepad++'s implementation of Regular Expressions requires that you use the
\x{NNNN}
notation to match Unicode characters.
In your example,
\x{0628}
can be used to match the ب
(bāʾ,bet,beth,vet) character.
The \u
symbol is used to match uppercase letters.
See http://sourceforge.net/apps/mediawiki/notepad-plus/index.php?title=Regular_Expressions#Ranges_or_kinds_of_characters
for an explanation of Notepad++'s regex syntax.