Interpretation of “stat_summary = mean_cl_boot” at ggplot2?

后端 未结 2 1610
暖寄归人
暖寄归人 2020-12-29 09:49

a perhaps simple question I tried to make an errorgraph like the one shown in page 532 of Field\'s \"Discovering Statistics Using R\".

The code can be found here htt

2条回答
  •  慢半拍i
    慢半拍i (楼主)
    2020-12-29 10:30

    I reproduced the graph using your code and I get essentially the same graph shown in Field's book, Discovering Statistics Using R, figure 12.12, page 532, except for the ordering of the variables on the x axis. The y axis displays the continuous variable, Mean Attractiveness of Date (%). The 95% confidence intervals, created--as you point out--with the stat_summary() function and the mean_cl_boot argument are bootstrap confidence intervals using the smean.cl.boot() function in Hmisc, as pointed out by another commenter above. This function is described on page 262 of the Hmisc documentation. The ggplot2 documentation on mean_cl_boot is sparse and defers to the description in the Hmisc package.

    Note that the arguments to mean_cl_boot in ggplot2 are the same as those in the smean.cl.boot function in the Hmisc package. You can change the desired confidence level from the default of .95 by using the conf.int argument and the number of bootstrap samples by using the B argument. Here, for example, is the code for creating the same plot with a 99% confidence interval and 5000 bootstrap samples:

    line <- ggplot(gogglesData, aes(alcohol, attractiveness, colour = gender))
    line + stat_summary(fun.y = mean, geom = "point") + 
    stat_summary(fun.y = mean, geom = "line", aes(group= gender)) + 
    stat_summary(fun.data = mean_cl_boot, conf.int = .99, B = 5000, geom = "errorbar", width = 0.2) + 
    labs(x = "Alcohol Consumption", y = "Mean Attractiveness of Date (%)", colour = "Gender") 
    

提交回复
热议问题