来源:https://stackoverflow.com/questions/30738924/detecting-corrupt-characters-in-utf-8-encoded-text-file 标签 regex encoding awk utf-8 scripting