Operations on a very large csv with pandas
问题 I have been using pandas on csv files to get some values out of them. My data looks like this: "A",23.495,41.995,"this is a sentence with some words" "B",52.243,0.118,"More text but contains WORD1" "A",119.142,-58.289,"Also contains WORD1" "B",423.2535,292.3958,"Doesn't contain anything of interest" "C",12.413,18.494,"This string contains WORD2" I have a simple script to read the csv and create the frequencies of WORD by group so the output is like: group freqW1 freqW2 A 1 0 B 1 0 C 0 1 Then