Binary decision tree model when the proportion of one of the labels is almost null
问题 I want to make a decision tree with two options to predict; "YES" or "NO". The dataset I am working with has 99% of "YES" answers and only 1% of "NO" answers. As I ran the model, the score is up to 97% of accuracy. Is it a valid model or are there any considerations to take into account when working with this kind of unbalanced proportions? I am afraid that because of the large amount of "YES" data, the model is very accurate by saying the answer to everything is "YES". The "NO"s are very