How to measure the execution time of a query on Spark

后端 未结 5 1331
误落风尘
误落风尘 2020-12-01 19:16

I need to measure the execution time of query on Apache spark (Bluemix). What I tried:

import time

startTimeQuery = time.clock()
df = sqlContext.sql(query)
         


        
5条回答
  •  慢半拍i
    慢半拍i (楼主)
    2020-12-01 19:53

    SPARK itself provides much granular information about each stage of your Spark Job.

    You can view your running job on http://IP-MasterNode:4040 or You can enable History server for analyzing the jobs at a later time.

    Refer here for more info on History server.

提交回复
热议问题