R wrapper for antiword utility
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
R
man Update file URL Apr 29, 2017
src Further increase buffer Oct 7, 2018
.Rbuildignore Wording Apr 22, 2017
.gitignore Tweaks for Windows Apr 22, 2017
.travis.yml Fix Travis May 11, 2018
DESCRIPTION Increase some buffers May 10, 2018
NAMESPACE first commit Apr 22, 2017
NEWS Increase some buffers May 10, 2018
README.md Add inactive status badge (#2) May 11, 2018
antiword.Rproj first commit Apr 22, 2017
appveyor.yml Enable CI stuff Apr 22, 2017

README.md

antiword

Project Status: Active – The project has reached a stable, usable state and is being actively developed. Build Status AppVeyor Build Status Coverage Status CRAN_Status_Badge CRAN RStudio mirror downloads

Extract Text from Microsoft Word Documents

Wraps the AntiWord utility to extract text from Microsoft Word documents. The utility only supports the old doc format, not the new xml based docx format.

Installation

devtools::install_github("ropensci/antiword")

Hello World

The function has only a single function antiword(). It takes either a local file path or a URL to a word document:

library(antiword)
text <- antiword("https://jeroen.github.io/files/UDHR-english.doc")
cat(text)