Extract Text from Rich Text Format (rtf) Documents

Wraps the unrtf utility to extract text from rtf files.

Installation

Hello World

The function has only a single function unrtf(). It takes either a local file path or a URL to a word document:

library(unrtf)
text <- unrtf("https://jeroen.github.io/files/sample.rtf", format = "text")
html <- unrtf("https://jeroen.github.io/files/sample.rtf", format = "html")
cat(text)
###  Translation from RTF performed by UnRTF, version 0.21.9 
### font table contains 11 fonts total

TITLE: It is an example test rtf-file to RTF2XML bean for testing

AUTHOR: kissj
### creation date: 17 April 2000 15:34 
### revision date: 19 April 2000 09:34 
### total pages: 2
### total words: 217
### total chars: 1240

-----------------
It is an example test rtf-file to RTF2XML bean for testing

Font size 10, plain text;
Font size 12, bold text. Underline,bold text.
 Underline,italic,bold text. 
Font size 22, plain text.
 Bold text.