For loop R create and populate new column with output

心已入冬 提交于 2019-12-10 13:52:43

问题


I've got a csv with some results. I want to loop over the results, run a power.prop.test and output the result for each row in a new column. So far I've got this:

data <- read.csv("Downloads/data.csv", sep = ",", header = TRUE)

for (i in 1:nrow(data)) {
  n <- power.prop.test( p1 = data[i,5], p2 = data[i,6], sig.level=.1, power = .8, alternative = "one.sided")
  data <- cbind(data, n[1])
}
head(data)

Rather than populating one column with the output, I'm looping through and creating a new column for ever power.prop.test I'm running. I'm binding a new column for each output instead of populating one column with each output. Issue is I'm not sure how to achieve the latter.

If anyone has any advice on how to consolidate these outputs into one column that would be great.

Thanks!


回答1:


Try this:

data <- read.csv("Downloads/data.csv", sep = ",", header = TRUE)

data$newcolumn <- 0

for (i in 1:nrow(data)) {
  n <- power.prop.test( p1 = data[i,5], p2 = data[i,6], sig.level=.1, power = .8, alternative = "one.sided")
  data$newcolumn[i] <- n
}
head(data)

I just added a new column, filled it with zeroes, and then added in the power.prop.test values one at a time as they are calculated.




回答2:


Thanks for all the help! Here's the solution I'm using for now:

# Read in csv of results
data <- read.csv("Downloads/data.csv")
data$obs=0
data$obs2=0
data$sig=0

# Create a loop to calculate required observations for each test based on CR,     Significance and Statistical Power
for (i in 1:nrow(data)) {
# Error handling: where CR are the same, cannot calculate n and fails, we skip these tests
  possibleError <- tryCatch(
      n <- power.prop.test( p1 = data[i,5], p2 = data[i,6], sig.level=.2, power = .8, alternative = "one.sided")
  ,error=function(e)e
  )
  if(!inherits(possibleError, "error")){
# Real work: calculate n, determine if bigger than actual observations, if not assign   sig = 0, otherwise =1
   n <- power.prop.test( p1 = data[i,5], p2 = data[i,6], sig.level=.2, power = .8,  alternative = "one.sided")
    data[i,'obs'] = n[[1]]
    data$obs2[i] = data$obs[i] * 2
    if(data$obs[i]*2 < data$Traffic[i]) {
      data$sig[i] <- 1
    } else {
      data$sig[i] <-0
    }
    }
    }    
# End for loop

write.csv(data,file = "dataClean.csv")

This runs the power.prop.test on each row, includes error handling and a condition to tell you whether you have enough observations. I'm sure there are more efficient ways to write this, so I'll review your comments to see if I can incorporate them.




回答3:


Looks like a job for apply. To avoid any possible interference with factor or character variables, I'm just submitting the two columns as the argument.

data$power.col <- apply(data[5:6], 1,  
                      function(x) power.prop.test, p1 = x[1], p2 = x[2], 
                         sig.level=.1, power = .8, alternative = "one.sided")
      }



回答4:


First of all, I should say that I have never heard of power-prop tests. But then again, your question really isn't about the statistics, right?

Secondly, there is one package that is able to add columns to a data.frame, namely dplyr.

To quote the documentation: mutate(mtcars, displ_l = disp / 61.0237) - "Add a column displ_l (to mtcars) containing the values from the disp column divided by 61.0237."



来源:https://stackoverflow.com/questions/24619783/for-loop-r-create-and-populate-new-column-with-output

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!