My context is bioinformatics, next-generation sequencing in particular, but the problem is generic; so I will use a log file as an example.
The file is very large (G
To process lazy data muiltiple times, in constant space, you can do three things:
par to do n parallel traversals at the same timeThose are your options. The last one is the coolest :)