Run the pipeline you defined in the targets
script file (default: _targets.R). tar_make()
runs the correct targets in the correct order and stores the return
values in _targets/objects/. Use tar_read() to read a target
back into R, and see
https://docs.ropensci.org/targets/reference/index.html#clean
to manage output files.
Usage
tar_make(
names = NULL,
shortcut = targets::tar_config_get("shortcut"),
reporter = targets::tar_config_get("reporter_make"),
seconds_meta_append = targets::tar_config_get("seconds_meta_append"),
seconds_meta_upload = targets::tar_config_get("seconds_meta_upload"),
seconds_reporter = targets::tar_config_get("seconds_reporter"),
seconds_interval = targets::tar_config_get("seconds_interval"),
callr_function = callr::r,
callr_arguments = targets::tar_callr_args_default(callr_function, reporter),
envir = parent.frame(),
script = targets::tar_config_get("script"),
store = targets::tar_config_get("store"),
garbage_collection = NULL,
use_crew = targets::tar_config_get("use_crew"),
terminate_controller = TRUE,
as_job = targets::tar_config_get("as_job")
)Arguments
- names
Names of the targets to run or check. Set to
NULLto check/run all the targets (default). The object supplied tonamesshould be atidyselectexpression likeany_of()orstarts_with()fromtidyselectitself, ortar_described_as()to select target names based on their descriptions.- shortcut
Logical of length 1, how to interpret the
namesargument. IfshortcutisFALSE(default) then the function checks all targets upstream ofnamesas far back as the dependency graph goes.shortcut = TRUEincreases speed if there are a lot of up-to-date targets, but it assumes all the dependencies are up to date, so please use with caution. It relies on stored metadata for information about upstream dependencies.shortcut = TRUEonly works if you setnames.- reporter
Character of length 1, name of the reporter to user. Controls how messages are printed as targets run in the pipeline.
The default value of
reporteris the value returned bytar_config_get("reporter_make"). The default oftar_config_get("reporter_make")is"terse"if the calling R session is either:1. Non-interactive (`interactive()` returns `FALSE`), or 2. Inside a literate programming document (the `knitr.in.progress` global option is `TRUE`).Otherwise, the default is
"balanced". You can always set the reporter manually. Choices:"balanced": a reporter that balances efficiency with informative detail. Uses acliprogress bar instead of printing messages for individual dynamic branches. To the right of the progress bar is a text string like "22.6s, 4510+, 124-" (22.6 seconds elapsed, 4510 targets completed successfully so far, 124 targets skipped so far).For best results with the balanced reporter, you may need to adjust your
clisettings. See global optionscli.num_colorsandcli.dynamicat https://cli.r-lib.org/reference/cli-config.html. On that page is also theCLI_TICK_TIMEenvironment variable which controls the time delay between progress bar updates. If the delay is too low, then overhead from printing to the console may slow down the pipeline."terse": like the"balanced"reporter, but without a progress bar."silent": print nothing."timestamp": same as the"verbose"reporter except that each message begins with a time stamp."verbose": print messages for individual targets as they dispatch or complete. Each individual target-specific time (e.g. "3.487 seconds") is strictly the elapsed runtime of the target and does not include steps like data retrieval and output storage.
- seconds_meta_append
Positive numeric of length 1 with the minimum number of seconds between saves to the local metadata and progress files in the data store. his is an aggressive optimization setting not recommended for most users: higher values generally make the pipeline run faster, but unsaved work (in the event of a crash) is not up to date. When the pipeline ends, all the metadata and progress data is saved immediately, regardless of
seconds_meta_append.When the pipeline is just skipping targets, the actual interval between saves is
max(1, seconds_meta_append)to reduce overhead.- seconds_meta_upload
Positive numeric of length 1 with the minimum number of seconds between uploads of the metadata and progress data to the cloud (see https://books.ropensci.org/targets/cloud-storage.html). Higher values generally make the pipeline run faster, but unsaved work (in the event of a crash) may not be backed up to the cloud. When the pipeline ends, all the metadata and progress data is uploaded immediately, regardless of
seconds_meta_upload.- seconds_reporter
Deprecated on 2025-03-31 (
targetsversion 1.10.1.9010).- seconds_interval
Deprecated on 2023-08-24 (targets version 1.2.2.9001). Use
seconds_meta_appendandseconds_meta_uploadinstead.- callr_function
A function from
callrto start a fresh clean R process to do the work. Set toNULLto run in the current session instead of an external process (but restart your R session just before you do in order to clear debris out of the global environment).callr_functionneeds to beNULLfor interactive debugging, e.g.tar_option_set(debug = "your_target"). However,callr_functionshould not beNULLfor serious reproducible work.- callr_arguments
A list of arguments to
callr_function.- envir
An environment, where to run the target R script (default:
_targets.R) ifcallr_functionisNULL. Ignored ifcallr_functionis anything other thanNULL.callr_functionshould only beNULLfor debugging and testing purposes, not for serious runs of a pipeline, etc.The
envirargument oftar_make()and related functions always overrides the current value oftar_option_get("envir")in the current R session just before running the target script file, so whenever you need to set an alternativeenvir, you should always set it withtar_option_set()from within the target script file. In other words, if you calltar_option_set(envir = envir1)in an interactive session and thentar_make(envir = envir2, callr_function = NULL), thenenvir2will be used.- script
Character of length 1, path to the target script file. Defaults to
tar_config_get("script"), which in turn defaults to_targets.R. When you set this argument, the value oftar_config_get("script")is temporarily changed for the current function call. Seetar_script(),tar_config_get(), andtar_config_set()for details about the target script file and how to set it persistently for a project.- store
Character of length 1, path to the
targetsdata store. Defaults totar_config_get("store"), which in turn defaults to_targets/. When you set this argument, the value oftar_config_get("store")is temporarily changed for the current function call. Seetar_config_get()andtar_config_set()for details about how to set the data store path persistently for a project.- garbage_collection
Deprecated. Use the
garbage_collectionargument oftar_option_set()instead to run garbage collection at regular intervals in a pipeline, or use the argument of the same name intar_target()to activate garbage collection for a specific target.- use_crew
Logical of length 1, whether to use
crewif thecontrolleroption is set intar_option_set()in the target script (_targets.R). See https://books.ropensci.org/targets/crew.html for details.- terminate_controller
Logical of length 1. For a
crew-integrated pipeline, whether to terminate the controller after stopping or finishing the pipeline. This should almost always be set toTRUE, butFALSEcombined withcallr_function = NULLwill allow you to get the running controller usingtar_option_get("controller")for debugging purposes. For example,tar_option_get("controller")$summary()produces a worker-by-worker summary of the work assigned and completed,tar_option_get("controller")$queueis the list of unresolved tasks, andtar_option_get("controller")$resultsis the list of tasks that completed but were not collected withpop(). You can manually terminate the controller withtar_option_get("controller")$summary()to close down the dispatcher and worker processes.- as_job
TRUEto run as an RStudio IDE / Posit Workbench job, if running on RStudio IDE / Posit Workbench.FALSEto run as acallrprocess in the main R session (depending on thecallr_functionargument). Ifas_jobisTRUE, then therstudioapipackage must be installed.
Value
NULL except if callr_function = callr::r_bg(), in which case
a handle to the callr background process is returned. Either way,
the value is invisibly returned.
Storage access
Several functions like tar_make(), tar_read(), tar_load(),
tar_meta(), and tar_progress() read or modify
the local data store of the pipeline.
The local data store is in flux while a pipeline is running,
and depending on how distributed computing or cloud computing is set up,
not all targets can even reach it. So please do not call these
functions from inside a target as part of a running
pipeline. The only exception is literate programming
target factories in the tarchetypes package such as tar_render()
and tar_quarto().
See also
Other pipeline:
tar_make_clustermq(),
tar_make_future()
Examples
if (identical(Sys.getenv("TAR_EXAMPLES"), "true")) { # for CRAN
tar_dir({ # tar_dir() runs code from a temp dir for CRAN.
tar_script({
library(targets)
library(tarchetypes)
list(
tar_target(y1, 1 + 1),
tar_target(y2, 1 + 1),
tar_target(z, y1 + y2)
)
}, ask = FALSE)
tar_make(starts_with("y")) # Only processes y1 and y2.
# Distributed computing with crew:
if (requireNamespace("crew", quietly = TRUE)) {
tar_script({
library(targets)
library(tarchetypes)
tar_option_set(controller = crew::controller_local())
list(
tar_target(y1, 1 + 1),
tar_target(y2, 1 + 1),
tar_target(z, y1 + y2)
)
}, ask = FALSE)
tar_make()
}
})
}