Este repositório contém as bases de dados e resultados do artigo publicado nos anais do
XX Brazilian Symposium on Information Systems (SBSI 2024), Maio, 2024, Juiz de Fora, Brasil
Abstract • Overview • Conteúdo • Autores
Context: In 2022, tourism earned R$208 billion, 28% more than in 2021, indicating a growing interest on the part of travelers in practical experiences during their trips, as well as creating content such as opinions, ratings, and recommendations. This has led both the public and private sectors to direct their efforts to improve service quality. Problem: Due to the large data volumes, discovering the tourist profile manually becomes impractical, making it imperative to adopt knowledge discovery techniques that automate this process. Solution: We present a solution for identifying the tourist profiles through persona analysis, using hotels from Pirenópolis in Goiás, Brazil, as a case study. IS Theory: The work was developed using the Social Media Engagement Theory about content generated by users on TripAdvisor and Booking.com platforms due to users experiencing a feeling of community, aligning with the central idea of theory highlighting the importance of active engagement. Method: In this study, we adopted the Cross-Industry Standard Process for Data Mining method, a consolidated approach in data mining. Using this method, we conducted exploratory analysis, profile analysis, topic modeling, and the grouping technique, enabling us to prepare a report based on the analyzed data. Summary of Results: The analysis of the topics resulted in identifying ten topics that reveal the preferences and needs of travelers. The application of the grouping technique allowed the creation of personas based on this data, enriching the understanding of the user profile. This results in a purposeful report on products and services best adapted to travelers' needs. Contributions and Impact in IS area: Facing challenges in systems in the era of innovation based on connected open data, this work contributes to this discussion as it dynamically fits, providing an innovative and practical approach to dealing with expanding information, automating data analysis processes, and increasing the operational efficiency of tourism companies in forming strategies to enhance decision-making.
O presente trabalho lida com uma crescente quantidade de dados não estruturados disponíveis em plataformas turísticas, que demandam avanços nos sistemas de informação e tecnologia para lidar com sua heterogeneidade e qualidade inconsistente, extraindo conhecimento útil ao domínio.
Outros pontos relevantes
- Contribuímos para a discussão sobre sistemas na era da inovação baseada em dados abertos conectados Boscarioli et al. 2017;
- Atuamos com o conceito de human-in-the-loop, envolvendo especialistas de domínio em todas as etapas, desde a coleta até a geração de insights;
- O pipeline experimental traz avanços na aplicação de técnicas de mineração de textos, com destaque para a Modelagem de Tópicos baseada no uso de representações word embeddings (e.g., BERT) e algoritmos de clusterização (e.g., K-Means) combinado com o Doc2Vec para a segmentação de clientes;
- Contribuímos na automatização de processos de análise de dados para aumentar a eficiência operacional das empresas de turismo e facilitar a tomada de decisões estratégicas.
- Base de dados: Contém as bases de dados com as avaliações coletadas das plataformas selecionadas (Booking.com e TripAdvisor).
- Análise Exploratória: Resultados da análise exploratória das bases de dados.
Gabriele S. Araújo |
Jonathan Oliveira Fernandez |
Marcelino S. da Silva |
Fábio M. F. Lobato |
@inproceedings{10.1145/3658271.3658340,
author = {Ara\'{u}jo, Gabriele De Sousa and Fernandez, Jonathan Oliveira and Da Silva, Marcelino Silva and Lobato, F\'{a}bio Manoel Fran\c{c}a},
title = {Persona and issue analysis on tourism social media: a case study of Piren\'{o}polis, Goi\'{a}s, Brazil},
year = {2024},
isbn = {9798400709968},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3658271.3658340},
doi = {10.1145/3658271.3658340},
booktitle = {Proceedings of the 20th Brazilian Symposium on Information Systems},
articleno = {68},
numpages = {10},
keywords = {Booking.com, Text Mining, Topic modeling, Tourism, TripAdvisor, User profile},
location = {<conf-loc>, <city>Juiz de Fora</city>, <country>Brazil</country>, </conf-loc>},
series = {SBSI '24}
}