I have trouble understanding the difference (if there is one) between roc_auc_score() and auc() in scikit-learn.
Im tying to predict a bina
predict returns only one class or the other. Then you compute a ROC with the results of predict on a classifier, there are only three thresholds (trial all one class, trivial all the other class, and in between). Your ROC curve looks like this:
..............................
|
|
|
......|
|
|
|
|
|
|
|
|
|
|
|
Meanwhile, predict_proba() returns an entire range of probabilities, so now you can put more than three thresholds on your data.
.......................
|
|
|
...|
|
|
.....|
|
|
....|
.|
|
|
|
|
Hence different areas.