Data cleaning in Excel sheets using R

后端 未结 3 1230
暗喜
暗喜 2021-01-27 01:30

I have data in Excel sheets and I need a way to clean it. I would like remove inconsistent values, like Branch name is specified as (Computer Science and Engineering, C.S.E, C.S

3条回答
  •  攒了一身酷
    2021-01-27 01:57

    You want a way to clean your data and you specify R. Is there a reason for it? (automation, remote control [console], ...)

    If not, I would suggest Open Refine. It is a great tool exactly for this job. It is not hosted, you can safely download it and run against your dataset (xls/xlsx work fine), you then create a text facet and group away.

    It uses advanced algorithms (and even gives you a choice) and is really helpful. I have cleaned a lot of data in no time.

    The videos at the official web site are useful.

提交回复
热议问题