Reading TSV into Spark Dataframe with Scala API

后端未结

关注

 2  1364

终归单人心 2020-12-03 10:02

I have been trying to get the databricks library for reading CSVs to work. I am trying to read a TSV created by hive into a spark data frame using the scala api.

Her

2条回答

盖世英雄少女心 (楼主)

2020-12-03 10:26
With Spark 2.0+ use the built-in CSV connector to avoid third party dependancy and better performance:
```
val spark = SparkSession.builder.getOrCreate()
val segments = spark.read.option("sep", "\t").csv("/path/to/file")
```
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...