Choosing eps and minpts for DBSCAN (R)?

后端 未结 6 2077
醉酒成梦
醉酒成梦 2020-12-23 12:06

I\'ve been searching for an answer for this question for quite a while, so I\'m hoping someone can help me. I\'m using dbscan from the fpc library in R. For example, I am

6条回答
  •  不知归路
    2020-12-23 12:35

    One common and popular way of managing the epsilon parameter of DBSCAN is to compute a k-distance plot of your dataset. Basically, you compute the k-nearest neighbors (k-NN) for each data point to understand what is the density distribution of your data, for different k. the KNN is handy because it is a non-parametric method. Once you choose a minPTS (which strongly depends on your data), you fix k to that value. Then you use as epsilon the k-distance corresponding to the area of the k-distance plot (for your fixed k) with a low slope.

提交回复
热议问题