发表新帖

发表新帖

PySpark logging from the executor

后端未结

关注

 3  1883

醉梦人生 2020-11-29 03:11

What is the correct way to access the log4j logger of Spark using pyspark on an executor?

It\'s easy to do so in the driver but I cannot seem to understand how to ac

3条回答

一个人的身影 (楼主)

2020-11-29 03:52
I have yet another approach to solve logging issue in PySpark. Idea is as follows:
- Use remote log management service (For example Loggly, CloudWatch on AWS, Application Insights on Azure etc)
- Configure logging module in both master node and worker nodes using same configuration to send logs to above sevices
This is good approach if you are already using cloud services as many of them also have log collection/management services.

I have a simple wordcount example on Github to demonstrate this approach https://github.com/chhantyal/wordcount

This Spark app sends logs to Loggly using standard logging module from driver (master node) as well as executors (worker nodes).
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题