Choosing eps and minpts for DBSCAN (R)?

后端 未结 6 2034
醉酒成梦
醉酒成梦 2020-12-23 12:06

I\'ve been searching for an answer for this question for quite a while, so I\'m hoping someone can help me. I\'m using dbscan from the fpc library in R. For example, I am

6条回答
  •  执念已碎
    2020-12-23 12:32

    For details on choosing parameters, see the paper below on p. 11:

    Schubert, E., Sander, J., Ester, M., Kriegel, H. P., & Xu, X. (2017). DBSCAN revisited, revisited: why and how you should (still) use DBSCAN. ACM Transactions on Database Systems (TODS), 42(3), 19.

    • For two-dimensional data: use default value of minPts=4 (Ester et al., 1996)
    • For more than 2 dimensions: minPts=2*dim (Sander et al., 1998)

    Once you know which MinPts to choose, you can determine Epsilon:

    • Plot the k-distances with k=minPts (Ester et al., 1996)
    • Find the 'elbow' in the graph--> The k-distance value is your Epsilon value.

提交回复
热议问题