Delete duplicate rows in two columns simultaneously [duplicate]

ぃ、小莉子 提交于 2019-11-26 17:16:17

问题


I would like to delete duplicate rows based in two collumns, instead just one.

My input df:

RAW.PVAL  GR     allrl  Bak
0.05      fr     EN1    B12
0.05      fg     EN1    B11
0.45      fr     EN2    B10
0.35      fg     EN2    B066

My output:

RAW.PVAL  GR  allrl   Bak
0.05      fr   EN1    B12
0.45      fg   EN2    B10
0.35      fg   EN2    B066

I had tried df<- subset(df, !duplicated(allrl, RAW.PVAL)), but I do not work to delete rows with this two columns simultaneously duplicated.

Thank you!


回答1:


If you want to use subset, you could try:

  subset(df, !duplicated(subset(df, select=c(allrl, RAW.PVAL))))
 # RAW.PVAL GR allrl  Bak
 #1     0.05 fr   EN1  B12
 #3     0.45 fr   EN2  B10
 #4     0.35 fg   EN2 B066

But, I think @thelatemail's approach would be better

  df[!duplicated(df[c("RAW.PVAL","allrl")]),]



回答2:


Use unique() to remove the duplicate values.



来源:https://stackoverflow.com/questions/25301476/delete-duplicate-rows-in-two-columns-simultaneously

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!