Skip to content

franciellevargas/NoHateBrazil

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 

Repository files navigation

DOI

NoHateBrazil: A Brazilian Portuguese Text Offensiveness Analysis System


WEB SYSTEM

http://143.107.183.175:14581/



ABOUT

Hate speech is a surely relevant problem in Brazil. Nevertheless, its regulation is not effective due to the difficulty to identify, quantify and classify offensive comments. Here, we introduce a novel system for offensive comment analysis in Brazilian Portuguese. The system titled “NoHateBrazil” recognizes explicit and implicit offensiveness in context at a fine-grained level. Specifically, we propose a framework for data collection, human annotation and machine learning models that were used to build the system. In addition, we assess the potential of our system to reflect stereotypical beliefs against marginalized groups by contrasting them with counter-stereotypes. As a result, a friendly web application was implemented, which besides presenting relevant performance, showed promising results towards mitigation of the risk of reinforcing social stereotypes. Lastly, new measures were proposed to improve the explainability of offensiveness classification and reliability of the model’s predictions.



CITING

Vargas, F., Carvalho, I., Schmeisser-Nieto, W., Pardo, T.A.S., Benevenuto, F. (2023). NohateBrazil: A Brazilian Portuguese Text Offensiveness Analysis System. Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, pp.1180--1186. Varna, Bulgaria. https://aclanthology.org/2023.ranlp-1.125



BIBTEX

@inproceedings{vargas-etal-2023-nohatebrazil, title = "{N}o{H}ate{B}razil: A {B}razilian {P}ortuguese Text Offensiveness Analysis System", author = "Vargas, Francielle and Carvalho, Isabelle and Schmeisser-Nieto, Wolfgang and Benevenuto, Fabr{\'\i}cio and Pardo, Thiago", editor = "Mitkov, Ruslan and Angelova, Galia", booktitle = "Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing", year = "2023", address = "Varna, Bulgaria", publisher = "INCOMA Ltd., Shoumen, Bulgaria", url = "https://aclanthology.org/2023.ranlp-1.125", pages = "1180--1186", }

FUNDING

SSC-logo-300x171 SSC-logo-300x171 SSC-logo-300x171