WikiCommons
Usage
wt_wikicommons(name, utf8 = TRUE, ...)
wt_wikicommons_parse(
page,
types = c("langlinks", "iwlinks", "externallinks", "common_names", "classification"),
tidy = FALSE
)
wt_wikicommons_search(query, limit = 10, offset = 0, utf8 = TRUE, ...)Arguments
- name
(character) Wiki name - as a page title, must be length 1
- utf8
(logical) If
TRUE, encodes most (but not all) non-ASCII characters as UTF-8 instead of replacing them with hexadecimal escape sequences. Default:TRUE- ...
curl options, passed on to
httr::GET()- page
(
httr::response()) Result ofwt_wiki_page()- types
(character) List of properties to parse
- tidy
(logical). tidy output to data.frame's if possible. Default:
FALSE- query
(character) query terms
- limit
(integer) number of results to return. Default: 10
- offset
(integer) record to start at. Default: 0
Value
wt_wikicommons returns a list, with slots:
langlinks - language page links
externallinks - external links
common_names - a data.frame with
nameandlanguagecolumnsclassification - a data.frame with
rankandnamecolumns
wt_wikicommons_parse returns a list
wt_wikicommons_search returns a list with slots for continue and
query, where query holds the results, with query$search slot with
the search results
References
https://www.mediawiki.org/wiki/API:Search for help on search
Examples
if (FALSE) { # \dontrun{
# high level
wt_wikicommons(name = "Malus domestica")
wt_wikicommons(name = "Pinus contorta")
wt_wikicommons(name = "Ursus americanus")
wt_wikicommons(name = "Balaenoptera musculus")
wt_wikicommons(name = "Category:Poeae")
wt_wikicommons(name = "Category:Pinaceae")
# low level
pg <- wt_wiki_page("https://commons.wikimedia.org/wiki/Malus_domestica")
wt_wikicommons_parse(pg)
# search wikicommons
# FIXME: utf=FALSE for now until curl::curl_escape fix
# https://github.com/jeroen/curl/issues/228
wt_wikicommons_search(query = "Pinus", utf8 = FALSE)
## use search results to dig into pages
res <- wt_wikicommons_search(query = "Pinus", utf8 = FALSE)
lapply(res$query$search$title[1:3], wt_wikicommons)
} # }
