Estimating unknown response variable in JAGS - unsupervised learning

不问归期 提交于 2019-12-11 11:34:10

问题


I am attempting to estimate response values of percentage cover (COV) from known distribution parameters. I can do this by specifying the response data as NAs in OpenBUGS (e.g. the code below) but JAGS won't allow this. Does anyone know how I can achieve this in JAGS?

I think this falls into the category of 'unsupervised statistical learning'

model {
  for (i in 1:n.sites) {  # loop around sites
    # specify prior distribution forms for effectively unknown percentage  cover
    COV[i] ~ dbeta(a[i], b[i])T(r1[i], r2[i]) 
  }  
}

# DATA
list(n.sites=5, COV=c(NA, NA, NA, NA, NA), a=c(7.1,2.2,13,10,13),
     b=c(25,11,44,27,44), r1=c(0.05,0.1,0.2,0.1,0.2),
     r2=c(0.15,0.3,0.6,0.3,0.6) )

# INITS
list(COV=c(0.1, 0.2, 0.4, 0.2, 0.4))

回答1:


If you just want to simulate values for COV that are consistent with your truncated beta distribution, then you can omit COV from your list of data. For example:

library(R2jags)

cat('
model {
  for (i in 1:n.sites) {
    COV[i] ~ dbeta(a[i], b[i])T(r1[i], r2[i]) 
  }  
}', file={M <- tempfile()})


dat <- list(n.sites=5, a=c(7.1, 2.2, 13, 10, 13), b=c(25, 11, 44, 27, 44), 
            r1=c(0.05, 0.1, 0.2, 0.1, 0.2), r2=c(0.15, 0.3, 0.6, 0.3, 0.6))

j <- jags(dat, NULL, 'COV', M, 1, 10000, DIC=FALSE, n.burnin=0, n.thin=1)

Here are the top few rows of the matrix of 10000 simulated COV vectors...

head(j$BUGSoutput$sims.list$COV)

##           [,1]      [,2]      [,3]      [,4]      [,5]
## [1,] 0.1169165 0.2889155 0.2543063 0.2083161 0.2426788
## [2,] 0.1494647 0.1430956 0.2867575 0.2410594 0.2795923
## [3,] 0.1200414 0.2093230 0.2736719 0.2189734 0.2469634
## [4,] 0.1472082 0.1442609 0.2911482 0.2625216 0.2714883
## [5,] 0.1403574 0.1100977 0.2556352 0.1918480 0.2353231
## [6,] 0.1310404 0.1677148 0.3011752 0.1974136 0.2131811

EDIT

Since you're just sampling from a known distribution, you can also do this in pure R (the distr package provides some functions that help with sampling from truncated distributions).

library(distr)
n <- 10000 # How many samples?
COV <- mapply(function(shape1, shape2, min, max) {
  r(Truncate(Beta(shape1, shape2), min, max))(n)
}, shape1=c(7.1, 2.2, 13, 10, 13), shape2=c(25, 11, 44, 27, 44), 
   min=c(0.05, 0.1, 0.2, 0.1, 0.2), max=c(0.15, 0.3, 0.6, 0.3, 0.6))

Above, you should pass equal-length vectors of shape1, shape2, min and max to mapply, which will generate n random beta-distributed variates for each shape1 and it's corresponding shape2, min and max in turn.

We can compare the kernel densities of the columns of COV (our pure-R samples) to those of the columns of j$BUGSoutput$sims.list$COV (our JAGS samples).

par(mfrow=c(3, 2), mar=c(3, 0.5, 0.5, 0.5))
sapply(1:5, function(i) {
  djags <- density(j$BUGSoutput$sims.list$COV[, i])
  dr <- density(COV[, i])
  plot(djags, lwd=4, col='gray80', main='', ylab='', xlab='', yaxt='n', 
       ylim=c(0, max(djags$y, dr$y)))
  lines(dr)
})
plot.new()
legend('topleft', c('JAGS', 'R'), col=c('gray80', 'black'), lwd=c(4, 1), bty='n')



来源:https://stackoverflow.com/questions/28533934/estimating-unknown-response-variable-in-jags-unsupervised-learning

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!