The Effect of Specifying Training Data as New Data when Making Random Forest Predictions in R
问题 While using the predict function in R to get the predictions from a Random Forest model, I misspecified the training data as newdata as follows: RF1pred <- predict(RF1, newdata=TrainS1, type = "class") Used like this, I get extremely high accuracy and AUC, which I am sure is not right, but I couldn't find a good explanation for it. This thread is the closest I got, but I can's say I fully understand the explanation there. If someone could elaborate, I will be grateful. Thank you! EDIT: