Grouped line plots in Plotly R: how to control line color?

纵然是瞬间 提交于 2019-12-29 09:21:07

问题


I have a bunch of 'paired' observations from a study for the same subject, and I am trying to build a spaghetti plot to visualize these observations as follows:

library(plotly)
df <- data.frame(id = rep(1:10, 2),
                 type = c(rep('a', 10), rep('b', 10)),
                 state = rep(c(0, 1), 10),
                 values = c(rnorm(10, 2, 0.5), rnorm(10, -2, 0.5)))
df <- df[order(df$id), ]
plot_ly(df, x = type, y = values, group = id, type = 'line') %>%
  layout(showlegend = FALSE)

It produces the correct plot I am seeking. But, the code shows each grouped line in own color, which is really annoying and distracting. I can't seem to find a way to get rid of colors.

Bonus question: I actually want to use color = state and actually color the sloped lines by that variable instead.

Any approaches / thoughts?


回答1:


You can set the lines to the same colour like this

plot_ly(df, x = type, y = values, group = id, type = 'scatter', mode = 'lines+markers', 
        line=list(color='#000000'), showlegend = FALSE)

For the 'bonus' two-for-the-price-of-one question 'how to color by a different variable to the one used for grouping':

If you were only plotting markers, and no lines, this would be simple, as you can simply provide a vector of colours to marker.color. Unfortunately, however, line.color only takes a single value, not a vector, so we need to work around this limitation.

Provided the data are not too numerous (in which case this method becomes slow, and a faster method is given below), you can set colours of each line individually by adding them as separate traces one by one in a loop (looping over id)

p <- plot_ly()
for (id in df$id) {
  col <- c('#AA0000','#0000AA')[df[which(df$id==id),3][1]+1] # calculate color for this line based on the 3rd column of df (df$state).
  p <- add_trace(data=df[which(df$id==id),], x=type, y=values, type='scatter', mode='markers+lines',
                 marker=list(color=col),
                 line=list(color=col), 
                 showlegend = FALSE,
                 evaluate=T)
  }
p

Although this one-trace-per-line approach is probably the simplest way conceptually, it does become very (impractically) slow if applied to hundreds or thousands of line segments. In this case there is a faster method, which is to plot only one line per colour, but to split this line up into multiple segments by inserting NA's between the separate segments and using the connectgaps=FALSE option to break the line into segments where there are missing data.

Begin by using dplyr to insert missing values between line segements (i.e. for each unique id we add a row containing NA in the columns that provide x and y coordinates).

library(dplyr)
df %<>% distinct(id) %>%
  `[<-`(,c(2,4),NA) %>%
  rbind(df) %>%
  arrange (id)

and plot, using connectgaps=FALSE:

plot_ly(df, x = type, y = values, group = state, type = 'scatter', mode = 'lines+markers', 
        showlegend = FALSE,
        connectgaps=FALSE)



来源:https://stackoverflow.com/questions/37923581/grouped-line-plots-in-plotly-r-how-to-control-line-color

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!