I would like to rewrite this from R to Pyspark, any nice looking suggestions?
array <- c(1,2,3) dataset <- filter(!(column %in% array))
* is not needed. So:
*
list = [1, 2, 3] dataframe.filter(~dataframe.column.isin(list))