Inspecting and visualizing gaps/blanks and structure in large dataframes

前端 未结 4 1834
遥遥无期
遥遥无期 2021-01-02 21:51

I have a large dataframe (400000 x 50) that I want to visually inspect for structure and blanks/gaps.

Is there an existing library or ggplot2 function, that can spit

4条回答
  •  忘掉有多难
    2021-01-02 22:48

    You may want to have a look at the tabplot package. With such a big data.frame it will take a while to load, but it should also correctly identify missing values. More info here.

    Here's an image example using the diamond data.frame.

    tabplot_diamonds

    EDIT

    I just saw that you said your df has 50 columns. I've used tabplot on df's that size and find the resolution of information limited by the screen breadth. The row count can also be an issue, but I personally find more information is lost if the df is too wide. Thus, may I suggest you parse it into 3 separate df (for example using dplyr) and then run them through the tableplot() function of tabplot or similar.

提交回复
热议问题