(null) entry in command string exception in saveAsTextFile() on Pyspark

前端未结

关注

 3  1358

梦谈多话 2020-11-27 19:37

I am working in PySpark on a Jupyter notebook (Python 2.7) in windows 7. I have an RDD of type pyspark.rdd.PipelinedRDD called idSums. When attempt

3条回答

时光取名叫无心 (楼主)

2020-11-27 20:25
You are missing winutils.exe a hadoop binary . Depending upon x64 bit / x32 bit System download the winutils.exe file & set your hadoop home pointing to it.

1st way :
1. Download the file
2. Create hadoop folder in Your System, ex C:
3. Create bin folder in hadoop directory, ex : C:\hadoop\bin
4. paste winutils.exe in bin, ex: C:\hadoop\bin\winuitls.exe
5. In User Variables in System Properties -> Advance System Settings
Create New Variable Name: HADOOP_HOME Path: C:\hadoop\

2nd Way :

You can set hadoop home directly in Your Java Program with the following Command like this :
```
System.setProperty("hadoop.home.dir","C:\hadoop" );
```
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...