问题
I've got a csv with some results. I want to loop over the results, run a power.prop.test and output the result for each row in a new column. So far I've got this:
data <- read.csv("Downloads/data.csv", sep = ",", header = TRUE)
for (i in 1:nrow(data)) {
n <- power.prop.test( p1 = data[i,5], p2 = data[i,6], sig.level=.1, power = .8, alternative = "one.sided")
data <- cbind(data, n[1])
}
head(data)
Rather than populating one column with the output, I'm looping through and creating a new column for ever power.prop.test I'm running. I'm binding a new column for each output instead of populating one column with each output. Issue is I'm not sure how to achieve the latter.
If anyone has any advice on how to consolidate these outputs into one column that would be great.
Thanks!
回答1:
Try this:
data <- read.csv("Downloads/data.csv", sep = ",", header = TRUE)
data$newcolumn <- 0
for (i in 1:nrow(data)) {
n <- power.prop.test( p1 = data[i,5], p2 = data[i,6], sig.level=.1, power = .8, alternative = "one.sided")
data$newcolumn[i] <- n
}
head(data)
I just added a new column, filled it with zeroes, and then added in the power.prop.test values one at a time as they are calculated.
回答2:
Thanks for all the help! Here's the solution I'm using for now:
# Read in csv of results
data <- read.csv("Downloads/data.csv")
data$obs=0
data$obs2=0
data$sig=0
# Create a loop to calculate required observations for each test based on CR, Significance and Statistical Power
for (i in 1:nrow(data)) {
# Error handling: where CR are the same, cannot calculate n and fails, we skip these tests
possibleError <- tryCatch(
n <- power.prop.test( p1 = data[i,5], p2 = data[i,6], sig.level=.2, power = .8, alternative = "one.sided")
,error=function(e)e
)
if(!inherits(possibleError, "error")){
# Real work: calculate n, determine if bigger than actual observations, if not assign sig = 0, otherwise =1
n <- power.prop.test( p1 = data[i,5], p2 = data[i,6], sig.level=.2, power = .8, alternative = "one.sided")
data[i,'obs'] = n[[1]]
data$obs2[i] = data$obs[i] * 2
if(data$obs[i]*2 < data$Traffic[i]) {
data$sig[i] <- 1
} else {
data$sig[i] <-0
}
}
}
# End for loop
write.csv(data,file = "dataClean.csv")
This runs the power.prop.test on each row, includes error handling and a condition to tell you whether you have enough observations. I'm sure there are more efficient ways to write this, so I'll review your comments to see if I can incorporate them.
回答3:
Looks like a job for apply
. To avoid any possible interference with factor or character variables, I'm just submitting the two columns as the argument.
data$power.col <- apply(data[5:6], 1,
function(x) power.prop.test, p1 = x[1], p2 = x[2],
sig.level=.1, power = .8, alternative = "one.sided")
}
回答4:
First of all, I should say that I have never heard of power-prop tests. But then again, your question really isn't about the statistics, right?
Secondly, there is one package that is able to add columns to a data.frame, namely dplyr
.
To quote the documentation:
mutate(mtcars, displ_l = disp / 61.0237)
- "Add a column displ_l (to mtcars) containing the values from the disp column divided by 61.0237."
来源:https://stackoverflow.com/questions/24619783/for-loop-r-create-and-populate-new-column-with-output