Converting CSV to ORC with Spark
问题 I've seen this blog post by Hortonworks for support for ORC in Spark 1.2 through datasources. It covers version 1.2 and it addresses the issue or creation of the ORC file from the objects, not conversion from csv to ORC. I have also seen ways, as intended, to do these conversions in Hive. Could someone please provide a simple example for how to load plain csv file from Spark 1.6+, save it as ORC and then load it as a data frame in Spark. 回答1: I'm going to ommit the CSV reading part because