I would like to rewrite this from R to Pyspark, any nice looking suggestions?
array <- c(1,2,3) dataset <- filter(!(column %in% array))
You can also loop the array and filter:
array = [1, 2, 3] for i in array: df = df.filter(df["column"] != i)