converting spark dataframe to pandas dataframe - ImportError: Pandas >= 0.19.2 must be installed
问题 I am trying to convert spark dataframe to pandas dataframe. I am trying to in Jupyter notebook on EMR. and I am trying following error. Pandas library is installed on master node under my user. And using spark shell (pyspark) I am able to convert df to padnas df on that master node. following command has been executed on all the master nodes pip --no-cache-dir install pandas --user Following is working on master node. But not from pyspark notebook import Pandas as pd Error No module named