Grouping on identical column names in pandas

雨燕双飞 提交于 2019-12-08 08:39:31

问题


time       A1    A1    A2    A2     A2    A3   A3 
2017-01    a1    a2    b1    b2      c    .....
2017-02    a3    a4    b3    b4      c
2017-03    a5    a6    b5    b6      c
....

There is a dataframe as shown above. How to get mean value of the columns which have the same name( as shown below)?

time            A1             A2           A3
2017-01    (a1+a2)/2       (b1+b2+c)/3      c
2017-02     .....
2017-03 

回答1:


Use groupby with level=0 and axis=1.

df.groupby(level=0, axis=1).mean()

np.random.seed(0)
df = pd.DataFrame(np.random.choice(10, (3, 5)), columns=list('AAABB'))

df

   A  A  A  B  B
0  5  0  3  3  7
1  9  3  5  2  4
2  7  6  8  8  1

df.groupby(level=0, axis=1).mean()

          A    B
0  2.666667  5.0
1  5.666667  3.0
2  7.000000  4.5


来源:https://stackoverflow.com/questions/53940035/grouping-on-identical-column-names-in-pandas

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!