Spark SQL - IN clause

后端 未结 1 1542
后悔当初
后悔当初 2020-12-09 12:13

I would like to add where condition for a column with Multiple values in DataFrame.

Its working for single value, for example.

df.where($\"type\".===         


        
相关标签:
1条回答
  • 2020-12-09 12:19

    the method you are looking for is isin:

    import sqlContext.implicits._
    df.where($"type".isin("type1","type2") and $"status".isin("completed","inprogress"))
    

    Typically, you want to do something like this

    val types = Seq("type1","type2")
    val statuses = Seq("completed","inprogress")
    
    df.where($"type".isin(types:_*) and $"status".isin(statuses:_*))
    
    0 讨论(0)
提交回复
热议问题