How to perform a merge of (too) large dataframes?
问题 I'm trying to merge couple of dataframes from HomeCredit Kaggle competion according to the data schema. I did following: train = pd.read_csv('~/Documents/HomeCredit/application_train.csv') bureau = pd.read_csv('~/Documents/HomeCredit/bureau.csv') bureau_balance = pd.read_csv('~/Documents/HomeCredit/bureau_balance.csv') train = train.merge(bureau,how='outer',left_on=['SK_ID_CURR'],right_on=['SK_ID_CURR']) train = train.merge(bureau_balance,how='inner',left_on=['SK_ID_BUREAU'],right_on=['SK_ID