How to get this data structure in R?

后端 未结 2 1316
轮回少年
轮回少年 2020-12-11 13:28

I am trying to find Wanted data structure from the current data structure. I know the schematics of the expected data structure partially. The wanted data structure includ

相关标签:
2条回答
  • 2020-12-11 14:00

    If you don't insist on M1, M2, etc. as column names, there is an even shorter data.table solution:

    library(data.table)   # CRAN version 1.10.4 used
    as.data.table(dat.m, keep.rownames = "Vars")
    #      Vars  V1 V2
    #1: ave_max 150 61
    #2:     ave  60  0
    #3:    lepo  41  0
    

    If you do insist on M1, M2, etc. as column names and your matrix dat.m has many columns, the columns can be renamed:

    DT <- as.data.table(dat.m, keep.rownames = "Vars")
    setnames(DT, stringr::str_replace(names(DT), "^V(?=\\d+$)", "M"))
    DT
    #      Vars  M1 M2
    #1: ave_max 150 61
    #2:     ave  60  0
    #3:    lepo  41  0
    

    The regular expression uses a look-ahead assertion to ensure that only columns starting with V and immediately followed and ended by at least one digit are changed. Others like Vars, V, V17b, VV3 aren't touched.


    If your matrix has many columns and the purpose of your operation is not just to have nice column headers for printing, you may consider to reshape your data from wide to long form. The long form is preferred by ggplotfor instance.

    DT_long <- melt(as.data.table(dat.m, keep.rownames = "Vars"), id.vars = "Vars")
    DT_long
    #      Vars variable value
    #1: ave_max       V1   150
    #2:     ave       V1    60
    #3:    lepo       V1    41
    #4: ave_max       V2    61
    #5:     ave       V2     0
    #6:    lepo       V2     0
    

    In long form, it is often easier to manipulate your data, for instance, to rename the columns:

    DT_long[, variable := stringr::str_replace(variable, "^V", "M")]
    DT_long
    #      Vars variable value
    #1: ave_max       M1   150
    #2:     ave       M1    60
    #3:    lepo       M1    41
    #4: ave_max       M2    61
    #5:     ave       M2     0
    #6:    lepo       M2     0
    

    Finally, you can reshape from long to wide form again

    dcast(DT_long, Vars ~ ...)
    #      Vars  M1 M2
    #1:     ave  60  0
    #2: ave_max 150 61
    #3:    lepo  41  0
    

    Note that the cast formula recognizes two special variables: . and .... . represents no variable; ... represents all variables not otherwise mentioned in formula. (See ?data.table::dcast for details).

    0 讨论(0)
  • 2020-12-11 14:02

    We can use tidyverse

    library(tidyverse)
    dat.m %>% 
        as.data.frame() %>% 
        rownames_to_column('Vars') %>%
        rename(M1 = V1, M2 = V2)
    #     Vars  M1 M2
    #1 ave_max 150 61
    #2     ave  60  0
    #3    lepo  41  0
    

    If we need to use data.table

    library(data.table)
    setnames(setDT(as.data.frame(dat.m), keep.rownames = TRUE), c('Vars', 'M1', 'M2'))[]
    
    0 讨论(0)
提交回复
热议问题