Large Matrices in R: long vectors not supported yet

前端 未结 3 2288
Happy的楠姐
Happy的楠姐 2020-12-06 00:53

I am running 64 bit R 3.1 in a 64bit Ubuntu environment with 400GB of RAM, and I am encountering a strange limitation when dealing with large matrices.

I have a num

相关标签:
3条回答
  • 2020-12-06 01:03

    A matrix is just an atomic vector with a dimension attribute which allows R to access it as a matrix. Your matrix is a vector of length 4000*9000000 which is 3.6e+10 elements (the largest integer value is approx 2.147e+9). Subsetting a long vector is supported for atomic vectors (i.e. accessing elements beyond the 2.147e+9 limit). Just treat your matrix as a long vector.

    If we remember that by default R fills matrices column-wise then if we wanted to retrieve say the value at test[ 2701 , 850000 ] we could access it via:

    i <- ( 2701 - 1 ) * 850000 + 2701 
    test[i]
    #[1] 1
    

    Note that this really is long vector subsetting because:

    2701L * 850000L
    #[1] NA
    #Warning message:
    #In 2701L * 850000L : NAs produced by integer overflow
    
    0 讨论(0)
  • 2020-12-06 01:21

    An alternate, quick-hand solution would be to first get the row and then the column (now the i'th element of the resulting vector) of the matrix. For example ...

    test <- matrix(1,4000,900000) #no error 
    test[1,1] #error
    test[1, ][1] # no error
    

    Of course, this produces some overhead, as the whole row is copied/accessed first, but it's more straightforward to read. Also works for first extracting the column and then the row.

    0 讨论(0)
  • 2020-12-06 01:24

    library(knitr)

    knitr::option$set(cache = TRUE, warning = FALSE,message = FALSE, cache.lazy = FALSE)

    0 讨论(0)
提交回复
热议问题