xgboost : The meaning of the base_score parameter

不问归期 提交于 2020-12-01 02:31:51

问题


In the documentation of xgboost I read:

base_score [default=0.5] : the initial prediction score of all instances, global bias

What is the meaning of this phrase? Is the base score the prior probability of the Event of Interest in the Dataset? I.e. in a dataset of 1,000 observations with 300 Positives and 700 Negatives the base score would be 0.3?

If not, what it would be?

Your advice will be appreciated.


回答1:


I think your understanding is correct, in your example the base score could be set to 0.3, or you can simply leave it to be the default 0.5. For highly imbalanced data you can initialize it to a more meaningful base score for an improved learning process. Theoretically, as long as you choose the right learning rate and give it enough steps to train, the starting base score shouldn't affect the result. Look at the author's answer in this issue.

Reference: https://github.com/dmlc/xgboost/issues/799



来源:https://stackoverflow.com/questions/47596486/xgboost-the-meaning-of-the-base-score-parameter

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!