A better way to load MongoDB data to a DataFrame using Pandas and PyMongo?

前端未结

关注

 4  1173

故里飘歌 2020-12-29 13:59

I have a 0.7 GB MongoDB database containing tweets that I\'m trying to load into a dataframe. However, I get an error.

MemoryError:

4条回答

半阙折子戏 (楼主)

2020-12-29 14:24

The from_records classmethod is probably the best way to do it:

from pandas import pd
import pymongo

client = pymongo.MongoClient()
data = db.mydb.mycollection.find() # or db.mydb.mycollection.aggregate(pipeline)

df = pd.DataFrame.from_records(data)

0 讨论(0)

查看其它4个回答