Skip to content
R package: Lexicons for Portuguese Text Analysis
R
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
R
data-raw fix non-ascii problem by removing special characters Oct 22, 2017
data fix non-ascii problem by removing special characters Oct 22, 2017
man polarity Dec 27, 2017
.Rbuildignore
.gitignore first commit Sep 21, 2017
DESCRIPTION
LICENSE fix pos submissao Oct 22, 2017
NAMESPACE
README.md changed get_word_sentiment() Dec 27, 2017
lexiconPT.Rproj first commit Sep 21, 2017

README.md

title output
lexiconPT
github_document

lexiconPT: An R package that provides lexicons for Portuguese Text Analysis

lexiconPT was developed to make it easy for analysts and researchers who want to perform text analysis and mining in Portuguese texts in R. Its main contribution is that it easily imports cleaned versions of some famous Portuguese lexicons, such as OpLexicon and SentiLex. These datasets can be loaded to R using the data() function:

data("sentiLex_lem_PT02")
data("oplexicon_v2.1")
data("oplexicon_v3.0")

Also, you can individually check a word sentiment by using the function lexiconPT::get_word_sentiment():

get_word_sentiment("temer")

The cleaning process can be seen in data-raw/setup.R. Note that, in order to clean the SentiLex dataset in a tidy way, some assumptions had to be made (I'm gonna write a blog post about it). On the other hand, both OpLexicon versions were already cleaned.

To know how both lexicons were developed, please see the help pages of the datasets to check their references.

To install lexiconPT, please run devtools::install_github("sillasgonzaga/lexiconPT").

You can’t perform that action at this time.