DataFrameReadercsv(path: String) option for skipping blank lines

好久不见. 提交于 2019-12-12 01:48:14

问题


Does

org.apache.spark.sqlDataFrameReadercsv(path: String) 

have an option for skipping blank lines? In particular, a blank line as the last line?


回答1:


You could try setting mode to "DROPMALFORMED" as in:

val df = sqlContext.read.format("com.databricks.spark.csv").option("mode", "DROPMALFORMED")...

In Python:

df = sqlContext.read.format('com.databricks.spark.csv').options(mode = "DROPMALFORMED")...

Which according to the documentation:

"...drops lines which have fewer or more tokens than expected."



来源:https://stackoverflow.com/questions/43476254/dataframereadercsvpath-string-option-for-skipping-blank-lines

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!