I am a newbie in text mining, here is my situation. Suppose i have a list of words [\'car\', \'dog\', \'puppy\', \'vehicle\'], i would like to cluster words into k groups, I
Adding on to what's already been said regarding similarity scores, finding k in clustering applications generally is aided by scree plots (also known as an "elbow curve"). In these plots, you'll usually have some measure of dispersion between clusters on the y-axis, and the number of clusters on the x-axis. Finding the minimum (second derivative) of the curve in the scree plot gives you a more objective measure of cluster "uniqueness."