Trouble with strings with Unicode characters

前端 未结 2 886
日久生厌
日久生厌 2020-12-20 12:35

I have a very large dataset (70k rows, 2600 columns, CSV format) that I have created by web scraping. Unfortunately, doing the pre-processing, processing etc. at some point

2条回答
  •  佛祖请我去吃肉
    2020-12-20 13:13

    Not sure it will work for you but for the same symptoms i did convert the strings to ascii:

    x <- iconv(x, "", "ASCII", "byte")
    

    For non ascii chars, the indication is "" with the hex code of the byte.

    You can then gsub the hex codes to the values that suit you.

提交回复
热议问题