Skip to content

fabiolobato/ENIAC23-SysMapping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository contains the database and article results published at ENIAC 2023.

Natural Language Processing and Social Media: a systematic mapping on Brazilian leading events

Abstract

The number of social media platforms has increased significantly, as well as the number of active users. More than 18.2 million text messages are transmitted every minute on these platforms. Given the amount of data available, Natural Language Processing (NLP) techniques have been used by several researchers to analyze this large amount of unstructured data. Thus, it is essential to understand social media analysis's main trends and challenges. From this perspective, this study presents a systematic mapping of NLP for social media analysis considering papers published in five well-established academic Brazilian events: BRACIS, BraSNAM, ENIAC, STIL, and PROPOR. The study aims to identify the main tools and techniques used, tasks performed, data sources, and evaluation measures. For this purpose, 186 studies were analyzed and carefully selected among the 654 papers published in these events in the three years (2020 to 2022). The results show a glimpse of the current scenario on the subject and point out areas that can be improved in future research with techniques for tasks such as text classification, sentiment analysis, and named-entity recognition. Therefore, this work can be helpful for academics interested in exploring the potential NLP for social media analysis and having a clear view of gaps, challenges, and research opportunities in this area. Nevertheless, it should guide the productive sector in this knowledge transfer, reducing the gap between the state of the art and practice, consequently increasing the competitiveness and innovation of social media analysis tools.

Events

The article was driven by a systematic mapping of publications in the proceedings of Brazilian scientific events that deal with text mining and social media analysis, specifically in the area of Natural Language Processing (NLP):

  • Brazilian Conference on Intelligent Systems (BRACIS)

  • Brazilian Workshop on Social Network Analysis and Mining (BraSNAM)

  • Encontro Nacional de Inteligência Artificial e Computacional (ENIAC)

  • International Conference on Computational Processing of the Portuguese Language (PROPOR)

  • Symposium in Information and Human Language Technology (STIL)

    Source Selected articles Year
    BRACIS 43 2020 - 2022
    BraSNAM 33 2020 - 2022
    ENIAC 49 2020 - 2022
    PROPOR 30 2020 - 2022
    STIL 31 2021

Contents

  1. Preprocessing: The database conversion from .xlsx to .csv, standardization, and attribute cleaning.
  2. Exploratory Analysis: Exploratory analysis of data and generation of graphs.
  3. Wordcloud: Word cloud generation from data sources.
  4. Datasets: Folder containing the databases comprising the mapped attributes.
  5. Results: Folder containing the analysis results.

Authors

Foto da Gabi
Gabriele de S. Araújo
Foto do Jéssica
Jéssica Brenda P. Leite
Foto do Marcelino
Marcelino S. da Silva
Foto do Jacob
Antonio F. L. Jacob Junior
Foto do Fábio
Fábio M. F. Lobato

Citation

@inproceedings{araujo2023natural,
  title={Natural Language Processing and Social Media: a systematic mapping on Brazilian leading events},
  author={Ara{\'u}jo, Gabriele de S and Leite, J{\'e}ssica Brenda P and da Silva, Marcelino S and Junior, Antonio FL Jacob and Lobato, F{\'a}bio MF},
  booktitle={Anais do XX Encontro Nacional de Intelig{\^e}ncia Artificial e Computacional},
  pages={741--755},
  year={2023},
  organization={SBC}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published