Counting non zero values in each column of a dataframe in python

前端 未结 3 1159
温柔的废话
温柔的废话 2020-12-07 22:54

I have a python-pandas-dataframe in which first column is user_id and rest of the columns are tags(tag_0 to tag_122). I have the data in the following format:



        
3条回答
  •  一个人的身影
    2020-12-07 23:10

    My favorite way of getting number of nonzeros in each column is

    df.astype(bool).sum(axis=0)
    

    For the number of non-zeros in each row use

    df.astype(bool).sum(axis=1)
    

    (Thanks to Skulas)

    If you have nans in your df you should make these zero first, otherwise they will be counted as 1.

    df.fillna(0).astype(bool).sum(axis=1)
    

    (Thanks to SirC)

提交回复
热议问题