How to fix “Task attempt_201104251139_0295_r_000006_0 failed to report status for 600 seconds.”
问题 I wrote a mapreduce job to extract some info from a dataset. The dataset is users' rating about movies. The number of users is about 250K and the number of movies is about 300k. The output of map is <user, <movie, rating>*> and <movie,<user,rating>*> . In the reducer, I will process these pairs. But when I run the job, the mapper completes as expected, but reducer always complain that Task attempt_* failed to report status for 600 seconds. I know this is due to failed to update status, so I