Pyspark dataframe operator “IS NOT IN”

后端 未结 7 1498
轮回少年
轮回少年 2020-12-08 14:15

I would like to rewrite this from R to Pyspark, any nice looking suggestions?

array <- c(1,2,3)
dataset <- filter(!(column %in% array))
7条回答
  •  Happy的楠姐
    2020-12-08 14:49

    slightly different syntax and a "date" data set:

    toGetDates={'2017-11-09', '2017-11-11', '2017-11-12'}
    df= df.filter(df['DATE'].isin(toGetDates) == False)
    

提交回复
热议问题