Can .SD be viewed from a browser within [.data.table()?

前端 未结 1 1459
青春惊慌失措
青春惊慌失措 2020-12-15 06:30

While constructing expressions to put in the j-slot of a [.data.table call, it would often be helpful to be able to examine and play around with th

相关标签:
1条回答
  • 2020-12-15 06:57

    Updated in light of Matthew Dowle's comments:

    It turns out that .SD is, internally, the environment within which all j expressions are evaluated, including those which don't explicitly reference .SD at all. Filling it with all of DT's columns for each subset of DT is not cheap, timewise, so [.data.table() won't do so unless it really needs to.

    Instead, making great use of R's lazy-evaluation of arguments, it previews the unevaluated j expression, and only adds to .SD columns that are referenced therein. If .SD itself is mentioned, it adds all of DT's columns.

    So, to view .SD, just include some reference to it in the j-expression. Here is one of many expressions that will work:

    library(data.table)
    DT = data.table(x=rep(c("a","b","c"),each=3), y=c(1,3,6), v=1:9)
    
    ## This works
    DT[, if(nrow(.SD)) browser(), by=x]
    # Called from: `[.data.table`(DT, , if (nrow(.SD)) browser(), by = x)
    Browse[1]> .SD
    #    y v
    # 1: 1 1
    # 2: 3 2
    # 3: 6 3
    

    And here are a couple more:

    DT[,{.SD; browser()}, by=x]
    DT[,{browser(); .SD}, by=x]  ## Notice that order doesn't matter
    

    To see for yourself that .SD just loads columns needed by the j-expression, run these each in turn (typing .SD when entering the browser environment, and Q to leave it and return to the normal command-line):

    DT[, {.N * y ; browser()}, by=x]
    DT[, {v^2 ; browser()}, by=x]
    DT[, {y*v ; browser()}, by=x]
    
    0 讨论(0)
提交回复
热议问题