Get full text research articles
rOpenSci has a number of R packages to get either full text, metadata, or both from various publishers. The goal of
fulltext is to integrate these packages to create a single interface to many data sources.
fulltext makes it easy to do text-mining by supporting the following steps:
Previously supported use cases, extracted out to other packages:
Data sources in
Authentication: A number of publishers require authentication via API key, and some even more draconian authentication processes involving checking IP addresses. We are working on supporting all the various authentication things for different publishers, but of course all the OA content is already easily available. See the Authentication section in
?fulltext-package after loading the package.
We’d love your feedback. Let us know what you think in the issue tracker (https://github.com/ropensci/fulltext/issues)
Article full text formats by publisher: https://docs.ropensci.org/fulltext/articles/formats
Stable version from CRAN
Development version from GitHub
cache_options_set(path = (td <- 'foobar')) res <- ft_get(c('10.7554/eLife.03032', '10.7554/eLife.32763'), type = "pdf") library(readtext) x <- readtext::readtext(file.path(cache_options_get()$path, "*.pdf"))