Data

This repository contains the code and data for a paper on the automated extraction of literary genres based on distributions of parts of speech.

Data

The data are taken from the ETCBC.

Experiment

The script experiment.R is set to be run from within RStudio.

Results

The results include a PCA biplot in 2D and one in 3D as well as a correlation plot. They can be inspected in the results folder.

Attribution

Please cite the actual paper in the Journal for North-West Semitic Languages when using this repo.

Johan de Joode, "The Distribution of Parts of Speech in the Literary Genres of the Hebrew Bible: A Digital Stylistic Approach, Journal of Northwest Semitic Languages 46/1 (2020), pp. 67-90

The research for this article was conducted as part of the project The Genes of Genre: Classifying Literary Text Types Using Statistical Modelling (3H180173, KU Leuven, with as principal investigators Eibert Tigchelaar, Pierre Van Hecke, and Dirk Speelman).

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
results		results
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
experiment.R		experiment.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data

Experiment

Results

Attribution

About

Releases 2

Packages

Languages

License

jdejoode/partsofspeech-JNSL

Folders and files

Latest commit

History

Repository files navigation

Data

Experiment

Results

Attribution

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages