This data set consists of 93,638 entries from a social network where users write predominantly in Brazilian Portuguese.
It is divided into three classes: 10, 20, and 30, containing users of both genders with ages between 13 and 17, 23 to 27, and 33 to 42 years, respectively. The 6-year interval between the end of an age range and the beginning of another was planned for a clearer differentiation.
Class | Range | Entries |
---|---|---|
10 | 13 to 17 | 55369 |
20 | 23 to 27 | 20958 |
30 | 33 to 42 | 17311 |
Total | 13 to 42 | 93638 |
When using this dataset for academic purposes, please cite our article:
Carvalho, F., Junior, F.P., Ogasawara, E., Ferrari, L. and Guedes, G., 2023. Evaluation of the Brazilian Portuguese version of linguistic inquiry and word count 2015 (BP-LIWC2015). Language Resources and Evaluation, pp.1-20.
@article{carvalho2023evaluation,
title={Evaluation of the Brazilian Portuguese version of linguistic inquiry and word count 2015 (BP-LIWC2015)},
author={Carvalho, Flavio and Junior, Fabio Paschoal and Ogasawara, Eduardo and Ferrari, Lilian and Guedes, Gustavo},
journal={Language Resources and Evaluation},
pages={1--20},
year={2023},
publisher={Springer}
}