Skip to content
Analyse corpora for prefixes and suffixes 1-5 characters long. Outputs csv files using R.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.


This is some horrible dirty code I knocked up to export prefixes and suffixes in a corpus to CSV files using R. I am a total beginner. The previous code included punctuation in prefixes and suffixes. This does not.

You should put your corpus in a working directory if it is just a single file.

You need to install the R packages tau and readr

There are no bad loops or anything like that in here.

The prefixes and suffixes are 1-5 characters long. This means, if you have the prefix "super", you also get "s", "su", "sup" and "supe".

Any problems email me at

You can’t perform that action at this time.