Using broadcasted dataframe in pyspark UDF
问题 Is it possible to use a broadcasted data frame in the UDF of a pyspark SQl application. My Code calls the broadcasted Dataframe inside a pyspark dataframe like below. fact_ent_df_data = sparkSession.sparkContext.broadcast(fact_ent_df.collect()) def generate_lookup_code(col1,col2,col3): fact_ent_df_count=fact_ent_df_data. select(fact_ent_df_br.TheDate.between(col1,col2), fact_ent_df_br.Ent.isin('col3')).count() return fact_ent_df_count sparkSession.udf.register("generate_lookup_code" ,