发表新帖

发表新帖

R and shared memory for parallel::mclapply

前端未结

关注

 3  1657

死守一世寂寞 2020-12-16 20:20

I am trying to take advantage of a quad-core machine by parallelizing a costly operation that is performed on a list of about 1000 items.

I am using R\'s parallel::m

3条回答

被撕碎了的回忆 (楼主)

2020-12-16 21:11
I guess I would have thought this would not have used extra memory because of the copy-on-write functionality. I take it the elements of the list are large? Perhaps when R passes the elements to fun() it is actually making a copy of the list item instead of using copy on write. If so, the following might work better:
```
fun <- function(itemNumber){
  myitem <- lst[[itemNumber]]
  # now do your computations
}
res = rbind.fill(parallel::mclapply(1:length(lst), fun, mc.cores=3, mc.preschedule=T))
```
Or use lst[[itemNumber]] directly in your function. If R/Linux/macos isn't smart enough to use copy-on-write as you wrote the function, it may with this modified approach.

Edit: I assume you are not modifying the items in the list. If you do, R is going to make copies of the data.
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题