I have trouble understanding the difference (if there is one) between roc_auc_score()
and auc()
in scikit-learn.
Im tying to predict a bina
predict
returns only one class or the other. Then you compute a ROC with the results of predict
on a classifier, there are only three thresholds (trial all one class, trivial all the other class, and in between). Your ROC curve looks like this:
..............................
|
|
|
......|
|
|
|
|
|
|
|
|
|
|
|
Meanwhile, predict_proba()
returns an entire range of probabilities, so now you can put more than three thresholds on your data.
.......................
|
|
|
...|
|
|
.....|
|
|
....|
.|
|
|
|
|
Hence different areas.