So I want to cluster the records in this table to find which records are \'similar\' (i.e. have enough in common). An example of the table is as follows: