ft_collect grabs full text data from file paths in your ft_data object (result of call to ft_get()). ft_text is a convenience function to grab the nested text data and bring it up in the list for easier access

ft_collect(x, ...)

ft_text(x, ...)

# S3 method for default
ft_text(x, ...)

# S3 method for ft_data
ft_text(x, ...)



Input. An object of class ft_data


Further args, ignored.


an object of class ft_data, but the data slot should have character string of text from the XML/plain text/PDF file


The result of this call is actual text you can read


if (FALSE) { # Get some data x <- ft_get('10.1371/journal.pone.0086169') # note that the data is not in the object, gives NULL x$plos$data$data # Collect data from the .xml file y <- x %>% ft_collect() # note how the data is now in the object y$plos$data$data # Let's get the actual ## ft_collect() alone, replaces file pointers with parsed text, ## maintaining object structure x %>% ft_collect() ## pulls the text out of the object x %>% ft_collect() %>% ft_text() }