Filtering a Pyspark DataFrame with SQL-like IN clause

前端 未结 5 1638
清酒与你
清酒与你 2020-11-27 02:54

I want to filter a Pyspark DataFrame with a SQL-like IN clause, as in

sc = SparkContext()
sqlc = SQLContext(sc)
df = sqlc.sql(\'SELECT * from my         


        
5条回答
  •  无人及你
    2020-11-27 03:59

    You can also do this for integer columns:

    df_filtered = df.filter("field1 in (1,2,3)")
    

    or this for string columns:

    df_filtered = df.filter("field1 in ('a','b','c')")
    

提交回复
热议问题