It is described in Mahout in Action that normalization can slightly improve the accuracy. Can anyone explain the reason, thanks!
the reason behind it is that sometimes the measurements of the different variables are different in nature so the variance of the results is adjusted by normalizing. for instance in an age(x) vs weight (y) comparison for a set of children, the age can go from one to 10 and the weight can go from 10 pounds to 100. if you dont normalize the graphic will produce a two very weird long oval shapes to the right of your graph since both scales need to go form one to 100. normalizing would give both axis a 1 to 100 scale hecnce the graphic will show more meaningful clusters.