How to split a huge csv file based on content of first column?

后端 未结 7 2096
一个人的身影
一个人的身影 2020-12-01 17:01
  • I have a 250MB+ huge csv file to upload
  • file format is group_id, application_id, reading and data could look like


        
7条回答
  •  忘掉有多难
    2020-12-01 17:33

    If the file is already sorted by group_id, you can do something like:

    import csv
    from itertools import groupby
    
    for key, rows in groupby(csv.reader(open("foo.csv")),
                             lambda row: row[0]):
        with open("%s.txt" % key, "w") as output:
            for row in rows:
                output.write(",".join(row) + "\n")
    

提交回复
热议问题