问题
I wonder if there is an easy way to calculate the duration. I have a dataset where the a parameter, called m
, varies between the values -1
and 1
during time. I want to calculate:
- The total duration (time in hours) of cases where
m=-1
andm=1
respectively How long is each period of cases where
m=-1
andm=1
respectively ism<-c(1,1,1,1,-1,-1,-1,-1,1,1,1,1,1,-1,-1,-1,-1,-1,1,1,1,1,1,1,1)
Time <- seq.POSIXt(as.POSIXct(Sys.Date()), as.POSIXct(Sys.Date()+1), by = "1 hour")
回答1:
I'd use package data.table for "split-apply-combine" and identify the runs using cumsum
and diff
:
DF <- read.table(text="Time, m
2015-01-01 00:00, -1
2015-01-01 01:00, -1
2015-01-01 02:00, -1
2015-01-01 03:00, 1
2015-01-01 04:00, 1
2015-01-01 05:00, 1
2015-01-01 06:00, 1
2015-01-01 07:00, 1
2015-01-01 08:00, -1
2015-01-01 09:00, -1
2015-01-01 10:00, -1
2015-01-01 11:00, -1
2015-01-01 12:00, 1
2015-01-01 13:00, 1
2015-01-01 14:00, 1
2015-01-01 15:00, -1", header = TRUE, sep =",")
library(data.table)
setDT(DF)
DF[, Time := as.POSIXct(Time, format = "%Y-%m-%d %H:%M", tz = "GMT")]
DF[, run := cumsum(c(1, diff(m) != 0))]
DF1 <- DF[, list(m = unique(m),
duration = difftime(max(Time), min(Time), unit = "min")),
by = run]
# run m duration
#1: 1 -1 120 mins
#2: 2 1 240 mins
#3: 3 -1 180 mins
#4: 4 1 120 mins
#5: 5 -1 0 mins
DF1[, sum(duration), by = m]
# m V1
#1: -1 300
#2: 1 360
来源:https://stackoverflow.com/questions/29005218/calculate-duration-in-r