Proximity Matrix in sklearn.ensemble.RandomForestClassifier

后端未结

关注

 3  1783

盖世英雄少女心 2020-12-28 20:01

I\'m trying to perform clustering in Python using Random Forests. In the R implementation of Random Forests, there is a flag you can set to get the proximity matrix. I can\'

3条回答

温柔的废话 (楼主)

2020-12-28 20:52

We don't implement proximity matrix in Scikit-Learn (yet).

However, this could be done by relying on the apply function provided in our implementation of decision trees. That is, for all pairs of samples in your dataset, iterate over the decision trees in the forest (through forest.estimators_) and count the number of times they fall in the same leaf, i.e., the number of times apply give the same node id for both samples in the pair.

Hope this helps.

0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...