Spark Scala list folders in directory

前端 未结 9 2454
北恋
北恋 2020-12-05 09:41

I want to list all folders within a hdfs directory using Scala/Spark. In Hadoop I can do this by using the command: hadoop fs -ls hdfs://sandbox.hortonworks.com/demo/<

9条回答
  •  青春惊慌失措
    2020-12-05 10:32

    We are using hadoop 1.4 and it doesn't have listFiles method so we use listStatus to get directories. It doesn't have recursive option but it is easy to manage recursive lookup.

    val fs = FileSystem.get(new Configuration())
    val status = fs.listStatus(new Path(YOUR_HDFS_PATH))
    status.foreach(x=> println(x.getPath))
    

提交回复
热议问题