How do I diff utf-16 files with GNU diff?
GNU diff doesn't seem to be smart enough to detect and handle UTF-16 files, which surprises me. Am I missing an obvious command-line option? Is there a good alternative? From the GNU diff documentation: Handling Multibyte and Varying-Width Characters diff, diff3 and sdiff treat each line of input as a string of unibyte characters. This can mishandle multibyte characters in some cases. For example, when asked to ignore spaces, diff does not properly ignore a multibyte space character. Also, diff currently assumes that each byte is one column wide, and this assumption is incorrect in some