问题
I have 2 data frames train and label. The data frame train has 784 rows and 20K columns. The data frame label has 1 row and 20K columns. Each i column in label corresponds to i column in train. train is something like:
---->--- 20K Columns ---->
0 0 0 0 ... 3
1 0 . . ... .
4 0
9 7
. .
. .
. .
1 4
So for each i column where i belongs to {1,20K} there is a corresponding label in the label data frame which is something like:
---->----20K columns----->
0 -1 3 4 5 8 0 -5 -9 1 2 ....
The first column in train corresponds to the first column in label, second column in train corresponds to the second column in label and so on.
Now, I want to shuffle the train data frame. But if I shuffle train, the order with label will get lost. Is there a way where I could shuffle train data frame while maintaining order with label?
回答1:
Shuffle an ordering vector, and use that to order both objects.
shuffle <- sample(ncol(label))
label <- label[,shuffle]
train <- train[,shuffle]
An example with mtcars:
#create the label data frame
label <- data.frame(as.list(names(mtcars)), stringsAsFactors = FALSE)
label
# X.mpg. X.cyl. X.disp. X.hp. X.drat. X.wt. X.qsec. X.vs. X.am. X.gear. X.carb.
# 1 mpg cyl disp hp drat wt qsec vs am gear carb
shuffle <- sample(ncol(label))
mtcars <- mtcars[,shuffle]
label <- label[,shuffle]
label
# X.carb. X.wt. X.hp. X.cyl. X.mpg. X.gear. X.vs. X.am. X.drat. X.disp. X.qsec.
# 1 carb wt hp cyl mpg gear vs am drat disp qsec
head(mtcars)
# carb wt hp cyl mpg gear vs am drat disp qsec
# Mazda RX4 4 2.620 110 6 21.0 4 0 1 3.90 160 16.46
# Mazda RX4 Wag 4 2.875 110 6 21.0 4 0 1 3.90 160 17.02
# Datsun 710 1 2.320 93 4 22.8 4 1 1 3.85 108 18.61
# Hornet 4 Drive 1 3.215 110 6 21.4 3 1 0 3.08 258 19.44
# Hornet Sportabout 2 3.440 175 8 18.7 3 0 0 3.15 360 17.02
# Valiant 1 3.460 105 6 18.1 3 1 0 2.76 225 20.22
A more direct approach would be to rbind the two data frames, but I assumed you have them as separate objects for a reason.
来源:https://stackoverflow.com/questions/49584310/shuffle-a-data-frame-while-maintaining-order-with-another-data-frame