How to split data into training/testing sets using sample function

前端 未结 24 1799
猫巷女王i
猫巷女王i 2020-11-22 10:43

I\'ve just started using R and I\'m not sure how to incorporate my dataset with the following sample code:

sample(x, size, replace = FALSE, prob = NULL)
         


        
24条回答
  •  一生所求
    2020-11-22 11:09

    It can be easily done by:

    set.seed(101) # Set Seed so that same sample can be reproduced in future also
    # Now Selecting 75% of data as sample from total 'n' rows of the data  
    sample <- sample.int(n = nrow(data), size = floor(.75*nrow(data)), replace = F)
    train <- data[sample, ]
    test  <- data[-sample, ]
    

    By using caTools package:

    require(caTools)
    set.seed(101) 
    sample = sample.split(data$anycolumn, SplitRatio = .75)
    train = subset(data, sample == TRUE)
    test  = subset(data, sample == FALSE)
    

提交回复
热议问题