I am parsing very large JSON files (20,000+ lines) and i am using a multi-processing pool with the map function to help speed things up, however I keep hearing about using y