In the field of Data Mining, is there a specific sub-discipline called \'Similarity\'? If yes, what does it deal with. Any examples, links, references will be helpful.
Similarity is a concept that is used in several data mining tasks such as clustering, classification. Dependings on what kind of data you have, you may used different similarity measures such as cosine similarity for text documents, euclidian distance, etc