Unsupervised clustering with unknown number of clusters

后端 未结 6 584
攒了一身酷
攒了一身酷 2020-11-28 19:18

I have a large set of vectors in 3 dimensions. I need to cluster these based on Euclidean distance such that all the vectors in any particular cluster have a Euclidean dista

6条回答
  •  一整个雨季
    2020-11-28 19:39

    Use OPTICS, which works well with large data sets.

    OPTICS: Ordering Points To Identify the Clustering Structure Closely related to DBSCAN, finds core sample of high density and expands clusters from them 1. Unlike DBSCAN, keeps cluster hierarchy for a variable neighborhood radius. Better suited for usage on large datasets than the current sklearn implementation of DBSCAN

    from sklearn.cluster import OPTICS
    db = OPTICS(eps=3, min_samples=30).fit(X)
    

    Fine tune eps, min_samples as per your requirement.

提交回复
热议问题