A better way to load MongoDB data to a DataFrame using Pandas and PyMongo?

前端 未结 4 1174
故里飘歌
故里飘歌 2020-12-29 13:59

I have a 0.7 GB MongoDB database containing tweets that I\'m trying to load into a dataframe. However, I get an error.

MemoryError:    

My

4条回答
  •  半阙折子戏
    2020-12-29 14:17

    I've modified my code to the following:

    cursor = tweets.find(fields=['id'])
    tweet_fields = ['id']
    result = DataFrame(list(cursor), columns = tweet_fields)
    

    By adding the fields parameter in the find() function I restricted the output. Which means that I'm not loading every field but only the selected fields into the DataFrame. Everything works fine now.

提交回复
热议问题