tidyfreud

tidyfreud contains the complete work of Sigmund Freud in a tidy format, e.g. ready for NLP tasks.

Installation

You can install the development version of tidyfreud from GitHub with:

# install.packages("devtools")
devtools::install_github("gygl/tidyfreud")

Reproduction of text preparation

The source of the data is a PDF that contains Freud’s complete work and that was downloaded from the following website: https://www.valas.fr/?lang=fr. To reproduce the data preparation clone locally the repository:

git clone git@github.com:gygl/tidyfreud.git

and open the folder as an RStudio project. Then download the file Freud_Complete_Works.pdf and move it to ./data. You can then run the whole preprocessing steps by running the following command:

targets::tar_make()

All the pre-processing steps are done via the function:

create_sfreud_complete_work_tibble(path_pdf = "./data/Freud_Complete_Works.pdf")

that takes as argument path_pdf the path of the pdf file.

The main processing steps are:

import in R of the text and table of content contained in the PDF
tokenization of the text in sentence/words
detection of the book/article titles and subtitles

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
R		R
_targets/meta		_targets/meta
data		data
man		man
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
LICENSE.md		LICENSE.md
NAMESPACE		NAMESPACE
README.Rmd		README.Rmd
README.md		README.md
_targets.R		_targets.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

tidyfreud

Installation

Reproduction of text preparation

About

Licenses found

Releases 1

Packages

Languages

License

Licenses found

gygl/tidyfreud

Folders and files

Latest commit

History

Repository files navigation

tidyfreud

Installation

Reproduction of text preparation

About

Topics

Resources

License

Licenses found

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages