These functions are used to set the default skimming functions for a data
type. They are combined with the base skim function list (sfl) in
skim_with(), to create the summary tibble for each type.
Usage
get_skimmers(column)
# Default S3 method
get_skimmers(column)
# S3 method for class 'numeric'
get_skimmers(column)
# S3 method for class 'factor'
get_skimmers(column)
# S3 method for class 'character'
get_skimmers(column)
# S3 method for class 'logical'
get_skimmers(column)
# S3 method for class 'complex'
get_skimmers(column)
# S3 method for class 'Date'
get_skimmers(column)
# S3 method for class 'POSIXct'
get_skimmers(column)
# S3 method for class 'difftime'
get_skimmers(column)
# S3 method for class 'Timespan'
get_skimmers(column)
# S3 method for class 'ts'
get_skimmers(column)
# S3 method for class 'list'
get_skimmers(column)
# S3 method for class 'AsIs'
get_skimmers(column)
# S3 method for class 'haven_labelled'
get_skimmers(column)
modify_default_skimmers(skim_type, new_skim_type = NULL, new_funs = list())Details
When creating your own set of skimming functions, call sfl() within a
get_skimmers() method for your particular type. Your call to sfl() should
also provide a matching class in the skim_type argument. Otherwise, it
will not be possible to dynamically reassign your default functions when
working interactively.
Call get_default_skimmers() to see the functions for each type of summary
function currently supported. Call get_default_skimmer_names() to just see
the names of these functions. Use modify_default_skimmers() for a method
for changing the skim_type or functions for a default sfl. This is useful
for creating new default sfl's.
Methods (by class)
get_skimmers(default): The default method for skimming data. Only used when a column's data type doesn't match currently installed types. Call get_default_skimmer_names to see these defaults.get_skimmers(numeric): Summary functions for numeric columns, covering bothdouble()andinteger()classes:mean(),sd(),quantile()andinline_hist().get_skimmers(factor): Summary functions for factor columns:is.ordered(),n_unique()andtop_counts().get_skimmers(character): Summary functions for character columns. Also, the default for unknown columns:min_char(),max_char(),n_empty(),n_unique()andn_whitespace().get_skimmers(logical): Summary functions for logical/ boolean columns:mean(), which produces rates for each value, andtop_counts().get_skimmers(complex): Summary functions for complex columns:mean().get_skimmers(Date): Summary functions forDatecolumns:min(),max(),median()andn_unique().get_skimmers(POSIXct): Summary functions forPOSIXctcolumns:min(),max(),median()andn_unique().get_skimmers(difftime): Summary functions fordifftimecolumns:min(),max(),median()andn_unique().get_skimmers(Timespan): Summary functions forTimespancolumns:min(),max(),median()andn_unique().get_skimmers(ts): Summary functions fortscolumns:min(),max(),median()andn_unique().get_skimmers(list): Summary functions forlistcolumns:n_unique(),list_min_length()andlist_max_length().get_skimmers(AsIs): Summary functions forAsIscolumns:n_unique(),list_min_length()andlist_max_length().get_skimmers(haven_labelled): Summary functions forhaven_labelledcolumns. Finds the appropriate skimmers for the underlying data in the vector.
Examples
# Defining default skimming functions for a new class, `my_class`.
# Note that the class argument is required for dynamic reassignment.
get_skimmers.my_class <- function(column) {
sfl(
skim_type = "my_class",
mean,
sd
)
}
# Integer and double columns are both "numeric" and are treated the same
# by default. To switch this behavior in another package, add a method.
get_skimmers.integer <- function(column) {
sfl(
skim_type = "integer",
p50 = ~ stats::quantile(
.,
probs = .50, na.rm = TRUE, names = FALSE, type = 1
)
)
}
x <- mtcars[c("gear", "carb")]
class(x$carb) <- "integer"
skim(x)
#> ── Data Summary ────────────────────────
#> Values
#> Name x
#> Number of rows 32
#> Number of columns 2
#> _______________________
#> Column type frequency:
#> numeric 2
#> ________________________
#> Group variables None
#>
#> ── Variable type: numeric ──────────────────────────────────────────────────────
#> skim_variable n_missing complete_rate mean sd p0 p25 p50 p75 p100 hist
#> 1 gear 0 1 3.69 0.738 3 3 4 4 5 ▇▁▆▁▂
#> 2 carb 0 1 2.81 1.62 1 2 2 4 8 ▇▂▅▁▁
if (FALSE) { # \dontrun{
# In a package, to revert to the V1 behavior of skimming separately with the
# same functions, assign the numeric `get_skimmers`.
get_skimmers.integer <- skimr::get_skimmers.numeric
# Or, in a local session, use `skim_with` to create a different `skim`.
new_skim <- skim_with(integer = skimr::get_skimmers.numeric())
# To apply a set of skimmers from an old type to a new type
get_skimmers.new_type <- function(column) {
modify_default_skimmers("old_type", new_skim_type = "new_type")
}
} # }
