hey all, just getting started on hadoop and curious what the best way in mapreduce would be to count unique visitors if your logfiles looked like this...
DAT
My aproach is similar to what tzaman gave with a small twist
Note that the first reduce does not need to go over any of the records is gets presented. You can simply examine the key and produce the output.
HTH