Converting JavaRDD to DataFrame in Spark java

后端 未结 4 791
暗喜
暗喜 2020-12-19 01:27

I am trying to process the LogFile. first i read the log file and split these file as per my requirement and saved each column into separate JavaRDD. Now i need to convert t

4条回答
  •  天命终不由人
    2020-12-19 02:19

    Just flatmap your data according to 7 column table and use code snippet below

    String[] columns = new String[7] {"clumn1","column2","column3","column4","column5","column6","column7"};
    List tableColumns = Arrays.asList(columns);
    
    StrucType schema = createSchema(tableColumns);
    
        public StructType createSchema(List tableColumns){
    
            List fields  = new ArrayList();
            for(String column : tableColumns){         
    
                    fields.add(DataTypes.createStructField(column, DataTypes.StringType, true));            
    
            }
            return DataTypes.createStructType(fields);
        }
    
    sqlContext.createDataFrame(urlRDD, schema);
    

提交回复
热议问题