Calculate summary statistics of columns in dataframe

后端 未结 3 1810
萌比男神i
萌比男神i 2020-12-07 20:38

I have a dataframe of the following form (for example)

shopper_num,is_martian,number_of_items,count_pineapples,birth_country,tranpsortation_method
1,FALSE,0,         


        
3条回答
  •  孤街浪徒
    2020-12-07 20:58

    Now there is the pandas_profiling package, which is a more complete alternative to df.describe().

    If your pandas dataframe is df, the below will return a complete analysis including some warnings about missing values, skewness, etc. It presents histograms and correlation plots as well.

    import pandas_profiling
    pandas_profiling.ProfileReport(df)
    

    See the example notebook detailing the usage.

提交回复
热议问题