The data frames produced by skim()
are wide and sparse, filled with
columns that are mostly NA
. For that reason, it can be convenient to
work with "by type" subsets of the original data frame. These smaller
subsets have their NA
columns removed.
Details
partition()
creates a list of smaller skim_df
data frames. Each entry
in the list is a data type from the original skim_df
. The inverse of
partition()
is bind()
, which takes the list and produces the original
skim_df
. While partition()
keeps all of the subtables as list entries,
yank()
gives you a single subtable for a data type.
Functions
bind()
: The inverse of apartition()
. Rebuild the originalskim_df
.yank()
: Extract a subtable from askim_df
with a particular type.
Examples
# Create a wide skimmed data frame (a skim_df)
skimmed <- skim(iris)
# Separate into a list of subtables by type
separate <- partition(skimmed)
# Put back together
identical(bind(separate), skimmed)
#> [1] TRUE
# > TRUE
# Alternatively, get the subtable of a particular type
yank(skimmed, "factor")
#>
#> ── Variable type: factor ───────────────────────────────────────────────────────
#> skim_variable n_missing complete_rate ordered n_unique top_counts
#> 1 Species 0 1 FALSE 3 set: 50, ver: 50, vir:…