Is there a way to use an apply type construct directly to the columns of a ffdf object? I am trying to count the NAs in each column without having to turn it into a standar
An ffdf is basically a list with elements "virtual", "physical", "row.names". If you do an lapply over the physical element, you have what you want.
require(ffbase)
myffdf <- as.ffdf(iris)
lapply(physical(myffdf), FUN=function(x) sum(is.na(x)))
As is.na and sum is generic, this will basically use is.na.ff
and sum.ff
from package ffbase such that data is loaded into RAM chunkwise according to what your computer can handle.